(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Int Hectual Property Organization 
International Bureau 

(43) International Publication Date 
28 December 2000 (28.12.2000) 




PCT 



IlllDIDIIlfllfllOIOlIillOHliai 

(10) International Publication Number 

WO 00/78953 A2 



(51) International Patent Classification 7 : C12N 15/12, 
5/10, C07K 14/47, 14/705, 16/18, 16/28, C12Q 1/68, 
A61K 38/17, GOIN 33/50, A01K 67/027 

(21) international Application Number: PCI7US00/ 16668 

(22) International Filing Date: 1 6 June 2000 (1 6.06.2000) 



(25) Filing Language: 



(26) Publication Language: 



English 



English 



(30) 



Priority Data: 

60/139,923 

60/148,177 

60/149,357 

60/162,287 



1 7 June 1 999 (1 7.06. 1999) US 

10 August 1999 (10.08. 1999) US 

18 August 1999 (18.08.1999) US 

28 October 1999 (28.10.1999) US 



(63) 



Related by continuation (CON) or continuation-in-part 
(CIP) to earlier applications: 

US 60/139,923 (CIP) 

Filed on 17 June 1999 (17.06.1999) 

US 60/148,177 (CIP) 

Filed on 10 August 1999 (10.08.1999) 

US 60/149,357 (CIP) 

Filed on 18 August 1999 (18.08.1999) 

US 60/162 T 287 (CIP) 

Filed on 28 October 1999 (28.10.1999) 



(71) Applicant (for all designated States except US): INCYTE 
GENOMICS, INC. [US/US]; 3160 Porter Drive, Palo 
Alto, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): LAL, Preeti 
[IN/US]; 2382 Lass Drive, Santa Clara, CA 95054 (US). 
YANG, Junming [CN/US]; 7125 Bark Lane, San Jose, 
CA 95129 (US). YUE, Henry [US/US]; 826 Lois Avenue, 
Sunnyvale, CA 94087 (US). HELLMAN, Jennifer, L. 



[US/US]; 230 Monroe Drive, #12, Mountain View, CA 
94040 (US). TANG, Y. t Tom [CN/US]; 4230 Ranwick 
Court, San Jose, CA 95118 (US). BANDMAN, Olga 
[US/US]; 366 Anna Avenue, Mountain View, CA 94043 
(US). BURFORD, Neil [GB/US]; 1308 4th Avenue, 
San Francisco, CA 94122 (US). BAUGHN, Marian, R. 
[US/US]; 14244 Santiago Road, San Leandro, CA 94577 
(US). AZIMZAI, Yalda [US/US]; 2045 Rocksprings 
Drive, Hayward, CA 94545 (US). LU, Dyung, Aina, M. 
[US/US]; 55 Park Belmont Place, San Jose, CA 95136 
(US). AU-YOUNG, Janice [USAJS]; 233 Golden Eagle 
Lane, Brisbane, CA 94005 (US). PATTERSON, Chandra 
[US/US]; 490 Sherwood Way, #1, Menlo Park, CA 94025 
(US). 

(74) Agents: HAMLET-COX, Diana et al.; Incyte Genomics, 
Inc., 3160 Porter Drive, Palo Alto, CA 94304 (US). 

(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, 
DM, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, 
UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

m 
oo 



(54) Title: HUMAN TRANSPORT PROTEINS 



(57) Abstract: The invention provides human transport proteins (TPPT) and polynucleotides which identify and encode TPPT. The 
invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also provides methods for 
diagnosing, treating, or preventing disorders associated with expression of TPPT. 



oiSOOCO: <WO 0078953A2 I > 



WO 00/78953 



PCT/US00/16668 



HUMAN TRANSPORT PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human transport proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of transport, metabolic, 
neurological, cardiovascular, reproductive, and immune disorders, and cell proliferative disorders 
including cancer- — 



BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes. These membranes act as a barrier to most molecules, and 
maintain the essential differences between the cytosol, the extracellular environment, and the contents 
of each intracellular organelle. Transport of essential nutrients, certain metal ions, metabolic waste 
products, cell signaling molecules, macromolecules, and proteins across lipid membranes and 
between organelles must be mediated by a variety of transport molecules. Transport between the 
cytoplasm and the extracellular environment, and between the cytoplasm and lumenal spaces of 
cellular organelles requires specific transport proteins. Each transport protein carries a particular 
class of molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular 
species of the class. 

Cells and organelles require transport molecules to import and export essential nutrients and 
metal ions including K\ NH 4 *. S0 4 2 \ sugars, and vitamins, as well as various metabolic waste 
products. Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, 
synaptic neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse 
cell functions (Griffith, J. and C. Sansom (1998) The Transporter Facts Book . Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 
change that transfers the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Transport proteins are often multi-pass transmembrane proteins, which either actively 
transport molecules across the membrane or passively allow them to cross. Active transport involves 
directional pumping of a solute across the membrane, usually against an electrochemical gradient. 
Active transport is tightly coupled to a source of metabolic energy, such as ATP hydrolysis or an 
electrochemically favorable ion gradient. Passive transport involves the movement of a solute down 
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its electrochemical gradient. Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a 
specific. solute to be transported and undergo a conformational change which transfers the bound • 
solute across the membrane. Channel proteins, which only function in passive transport, form 
hydrophilic pores across the membrane. When the pores open, specific solutes, such as inorganic 
ions, pass through the membrane and down the electrochemical gradient of the solute. Examples 
include facilitative transporters, the secondary active symporters and antiporters driven by ion 
gradients, and active ATP binding cassette transporters involved in multiple-drug resistance and 
targeting of antigenic peptides to MHC Class I molecules. Transported substrates range from 
nutrients and ions to a broad variety of drugs, peptides and proteins. 

Information on the action of ARL-6 (ADP-ribosylation like factor), an endoplasmic reticulum 
transmembrane protein, can be found in Greenfield, J.J. and S. High (1999; J. Cell Sci. 1 12:1477- 
1486). Information on reduced folate carrier transporter proteins can be found in Dixon, K.H. et al. 
(1994; J. Biol. Chem. 269: 17-20) and Moscow, J.A. et al. (1995; Cancer Res. 55:5983-5987). 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 
simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney epithelia contain a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
Na7K + ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1), 
iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have twelve 
putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically-oriented N- 
and C-termini. 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the brown fat 
uncoupling protein which transports protons from the cytosol into the matrix; the pyruvate carrier, the 
dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the tricarboxylate 
carrier which transports citrate and malate; and the Grave's disease carrier protein, a protein 
recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism (Stryer, L. (1995) Biochemistry . W.H. Freeman and Company, New York NY, p. 
551; PROSITE PDOC00I89 Mitochondrial energy transfer proteins signature; Online Mendelian 
Inheritance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
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proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
5 (1999) J. Int. Med. 245:637-642). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
^inc is required for the function of enzymes such as the extracellular matrix metal loproteinases, and 
zinc ions stabilize several motifs commonly found in transcription factors, including zinc fingers, zinc 
10 clusters, and LIM domains. Zinc and other metal ions must be provided in the diet, and are absorbed 
by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and 
other target organs, where specific transporters move the ions into cells and cellular organelles as 
needed. Imbalances in metal ion metabolism have been associated with a number of disease states 
(Danks, D.M. (1986) J. Med. Genet. 23:99-106). 
15 The largest and most diverse family of transport proteins known are the ATP-binding cassette 

(ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 
chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
20 energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the 
case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 
25 histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects 
in ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 
30 ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used 
in chemotherapy (Taglicht, D. and S. Michaeiis (1998) Methods Enzymol. 292:131-163). 

The nuclear pore complex (NPC) is a large multiprotein complex spanning the nuclear 
envelope which mediates the transport of proteins and RNA molecules between the nucleus and the 
cytoplasm, thus contributing to the regulation of gene expression. The NPC allows passive diffusion 
35 of ions, small molecules, and macromolecules under about 60kD, while larger macromolecules are 

transported by facilitated, energy-dependent pathways. Nuclear localization signals (NLS), consisting 
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of short stretches of amino acids enriched in basic residues, are found on proteins that are targeted to 
the nucleus, such as the glucocorticoid receptor. The NLS is recognized by the NLS receptor, 
importin, which then interacts with the monomelic GTP-binding protein Ran. This NLS 
protein/receptor/Ran complex navigates the nuclear pore with the help of the homodimeric protein 
5 nuclear transport factor 2 (NTF2) (Nakielny, S. and G. Dreyfuss (1997) Curr. Opin. Cell Biol. 9:420- 
429; Gorlich, D. (1997) Curr. Opin. Cell Biol. 9:412-419). Four O-linked glycoproteins, p62, p58, 
p54, and p45, exist as a stable "p62 complex" that forms a ring localized on both nucleoplasm^ and 
cytoplasmic surfaces of the NPC. The p62, p58, and p54 proteins all interact directly with the 
cytosolic transport factors p97 and NTF2, suggesting that the p62 complex is an important ligand 
10 binding site near the central gated channel of the NPC (Hu, T. et ai. (1996) J. Cell Biol. 134:589-601). 

Transport can also occur through intercellular bridges which connect the cytoplasms of sister 
cells, for example in the male and female germline of species ranging from fruit flies to humans. 
These bridges allow passage of cytoplasmic materials between cells during development. 
Intercellular bridges have also been found to connect somatic cells. The nurse cells and oocyte of a 
15 Drosophila egg chamber, which are derived from a single precursor cell through four rounds of 

mitosis, are connected to each other through intercellular bridges called ring canals. The cells do not 
completely separate after mitosis; the mitotic cleavage furrows are transformed into ring canals by the 
addition of an actin cytoskeleton lining the tunnels between the ceils. The Drosophila keich protein 
functions in organizing actin in the ring canal. Mutations in kelch cause female sterility in 
20 Drosophila. Kelch contains four protein domains: the NTR domain at the N-terminus, the BTB or 
POZ domain, the IVR or intervening region; and the kelch repeat domain, which contains six 50- 
amino acid kelch repeats. The BTB or POZ domain, a 120-amino acid motif that is also found in 
several zinc-finger containing transcription factors, may be important in dimerization of kelch. Kelch 
repeats are found in other proteins as well and may be important for actin binding (Robinson, D.N. 
25 and L. Cooley (1997) J. Cell Biol. 138:799-810; Cooley, L. (1998) Cell 93:913-915). 

Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form an ion- 

30 selective pore within the membrane. Ion channels share common structural and mechanistic themes. 
The channel consists of four or five subunits or protein monomers that are arranged like a barrel in 
the plasma membrane. Each subunit typically consists of six potential transmembrane segments (SI, 
S2, S3, S4, S5, and S6). The center of the barrel forms a pore lined by cc-helices or P-strands. The 
side chains of the amino acid residues comprising the cc-helices or P-strands establish the charge 

35 (cation or anion) selectivity of the channel. The degree of selectivity, or what specific ions are 

allowed to pass through the channel, depends on the diameter of the narrowest part of the pore. There 
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are two basic types of ion channels, ion transporters and gated ion channels. Ion transporters utilize 
the energy obtained from ATP hydrolysis to actively transport an ion against the ion's concentration 
gradient. Gated ion channels allow passive flow of an ion down the ion's electrochemical gradient 
under restricted conditions. Together, these types of ion channels generate, maintain, and utilize an 
electrochemical gradient that is used in 1) electrical impulse conduction down the axon of a nerve 
cell, 2) transport of molecules into cells against concentration gradients, 3) initiation of muscle 
contraction, and 4) endocrine cell secretion. 

Tnmsmembi^e-ATPases~^ 
transporters, including Na + -K + ATPase, Ca 2+ ATPase, H + ATPase, and Cu~ ATPase, are activated by 
a phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration 
of K + is high. The vacuolar (V) class of ion transporters include H + pumps on intracellular 
organelles, such as lysosomes and Golgi. V-class ion transporters are responsible for generating the 
low pH within the lumen of these organelles that is required for function. The coupling factor (F) 
class consists of H + pumps in the mitochondria. F-class ion transporters utilize a proton gradient to 
generate ATP from ADP and inorganic phosphate (P s ). 

Cu~ ATPases export copper from cells (PROSITE PDOC00139 E1-E2 ATPases 
phosphorylation site). Mutations in one Cu~ ATPase cause Wilson disease, in which toxic amounts 
of copper accumulate in a number of organs, particularly the liver and brain (Tanzi, R.E. et al. (1993) 
Nat. Genet. 5:344-350). Mutations in another Cu^ ATPase cause Menkes disease and occipital horn 
syndrome. Menkes disease mutations block export of copper from the gastrointestinal tract, leading 
to skeletal abnormalities, severe mental retardation, neurologic degeneration, and mortality in early 
childhood (Harrison, M.D. and C.T. Dameron (1999) J. Biochem. Mol. Toxicol. 13:93-106). 
Occipital horn syndrome mutations cause connective tissue defects (Harrison, supra ; Levinson, B. et 
al. (1996) Hum. Mol. Genet. 5:1737-1742). 

The coupling factor (F) class of ion transporters consists of FT pumps in mitochondria, 
chloroplasts, and bacteria. For example, the FqF, ATPase utilizes a proton gradient across the inner 
mitochondrial membrane to generate ATP from ADP and inorganic phosphate (P>). The F 0 F t ATPase 
is composed of the F 0 complex, which is the transmembrane channel through which protons flow, and 
the F, complex, where ATP synthesis activity resides. F 0 has three subunits, A (also known as protein 
6), B, and C (Lodish, H. et al. (1995) Molecular Cell Biology . Scientific American Books, New York 
NY, pp. 752-756; PROSITE PDOC00420 ATP synthase a subunit signature). 

Voltage-gated Ca 2+ channels are involved in presynaptic neurotransmitter release, and heart 
and skeletal muscle contraction. The voltage-gated Ca 2 * channels from skeletal muscle (L-type) and 
brain (N-type) have been purified and, though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The a, subunit forms the 
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membrane pore and voltage sensor, while the cu5 and p subunits modulate the voltage-dependence, 
gating properties, and the current amplitude of the channel. These subunits are encoded by at least 
six a„ one 0,8, and four p genes. A fourth subunit, y, has been identified. in skeletal muscle-(Walker,-. 
D. et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:490-492). The 
human 04 subunit is homologous to the mouse epilepsy gene lethargic , and is a candidate for 
involvement in neurological disorders including ataxia and absence epilepsy (Escayg, A. et al. (1998) 
Genomics 50:14-22). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds 
to the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter 
binds to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or 
muscle cells. There are two types of neurotransmitter-gated channels. Sodium channels open in 
response to excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This 
opening causes an influx of Na + and produces the initial localized depolarization that activates the 
voltage-gated channels and starts the action potential. Chloride channels open in response to 
inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, leading to 
hyperpolarization of the membrane and the subsequent generation of an action potential. 

Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, 
are also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and brain. Calcium-activated 
K + channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for 
shunting excess K + from brain capillary endothelial cells into the blood. They are also implicated in 
repolarizing granulocytes after agonist-stimulated depolarization (Ishi, T.M. et al. (1997) Proc. Natl. 
Acad. Sci. USA 94: 1 1651-1 1656). Another transmembrane protein, the leukotrine B4 receptor (BLT) 
appears to be involved in inflammation responses and host cell defense against infection. BLT also 
functions as an HIV coreceptor (Izumi, T. et al. (1997) Nature 387:620-624; Martin, V. et al. (1999) 
J. Biol. Chem. 274:8597-8603). 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan ( 1997) Adv. Pharmacol. 39:47-98). 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca~ and cAMP. In non-excitable tissue, K + channels 
are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, channels are responsible for setting resting membrane 
potential. The cytosol contains non-diffusible anions and, to balance this net negative charge, the cell 
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contains a Na + -K* pump and ion channels that provide the redistribution of Na + , K + , and CI". The 
pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the 
plasma membrane allow K + and CI * to flow by passive diffusion. Because of the high negative charge 
within the cytosol, CI " flows out of the cell. The flow of K + is balanced by an electromotive force 
5 pulling K + into the cell, and a K + concentration gradient pushing K + out of the cell. Thus, the resting 
membrane potential is primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489- 
492). Information on NY-REN-45, a K+ channel integral membrane protein, can be found in 

ScanlanrM7Jret alr(1998rInt7"JrCancer 767652-65 8 jTThe emopamil-binding protein (EBP) shares 

structural features with both pro- and eukaryotic drug transport proteins (Hanner, M. et al. (1995) J. 
10 Biol. Chem. 270:7551-7557). The Na+ channel, transmembrane protein myelin protein zero (MPZ) 
may be responsible for some sporadic cases of Dejerine-Scottas disease (hereditary motor and 
sensory neuropathy type HI) (Hayasaka, K. et al. (1993) Nat. Genet. 5:266-268). 

K + pore-forming subunits generally have six transmembrane-spanning domains with a short 
region between the fifth and sixth transmembrane regions that senses membrane potential; and the 
15 amino and carboxy termini are located intracellularly. In mammalian heart, the duration of 

ventricular action potential is controlled by a K + current. Thus, the K + channel is central to the 
control of heart rate and rhythm. channel dysfunctions are associated with a number of renal 
diseases including hypertension, hypokalemia, and the associated Bartter's syndrome and Getelman's 
syndrome, as well as neurological disorders including epilepsy. K + channels have been implicated in 
20 Alzheimer's disease by observations that a significant component of senile plaques, beta amyloid or 
A beta, also blocks voltage-gated potassium channels in hippocampal neurons (Antes, L.M. et al. 
(1998) Seminar Nephrol. 18:31-45; Stoffel, M. and L.Y. Jan (1998) Nat. Genet. 18:6-8; Madeja, M. 
et al. (1997) Eur. J. Neurosci. 9:390-395; Good, T.A. et al. (1996) Biophys. J. 70:296-304). 

Gated ion channels control ion flow by regulating the opening and closing of pores. These 
25 channels are categorized according to the manner of regulating the gating function. Mechanically- 
gated channels open pores in response to mechanical stress, voltage-gated channels open pores in 
response to changes in membrane potential, and ligand-gated channels open pores in the presence of a 
specific ion, nucleotide, or neurotransmitter. 

Voltage-gated Na + channels are responsible for electrical excitability of neurons, skeletal 
30 muscle, heart, and neuroendocrine tissues. For example, the sequential opening and closing of 

voltage-gated Na + channels results in the propagation of action potentials down neuronal axons. Na + 
channels isolated from rat brain tissue are heterotrimeric complexes composed of a 260 kDa pore 
forming a subunit that associates with two smaller auxiliary subunits, pi and P2. The p2 subunit is 
an integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 
35 and pi subunits correlates with increased function of the channel, a change in the channel's gating 

properties, as well as an increase in whole cell capacitance (Isom, L.L. et al. (1995) Cell 83:433-442). 
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Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 
are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
5 typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a-helical 
conformation. TM proteins are classified as bitopic (Types I and II) and poly topic (Types III and IV) 
(Singer, SJ. (1990) Annu. Rev. Cell Biol. 6:247-96). Bitopic proteins span the membrane once while 
poly topic proteins contain multiple membrane-spanning segments. TM proteins that act as cell- 
surface receptor proteins involved in signal transduction include growth and differentiation factor 
10 receptors, and receptor-interacting proteins such as Drosophiia pecanex and frizzled proteins, LiV-1 
protein, NF2 protein, and GNS1/SUR4 eukaryotic integral membrane proteins. TM proteins also act 
as transporters of ions or metabolites, such as gap junction channels (connexins) and ion channels, 
and as cell anchoring proteins, such as lectins, integrins, and fibronectins. TM proteins act as vesicle 
organelle-forming molecules, such as calveolins, or as cell recognition molecules, such as cluster of 
15 differentiation (CD) antigens, glycoproteins, and mucins. Information on connexin can be found in 
Kanter, H.L. et al. (1994; J. Mol. Cell. Cardiol. 26:861-868). 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, and 
GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-Iike domains. RGD, 
20 NGR, and GSL motif containing peptides have been used as drug delivery agents in cancer treatments 
which target tumor vasculature (Arap, W. et al. (1998) Science, 279:377-380.) Furthermore, MPs 
may also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), 
also known as the C-type lectin domain, that mediate interactions with extracellular or intracellular 
molecules. 

25 G-protein coupled receptors (GPCR) comprise a superfamily of integral membrane proteins 

which transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators 
of inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
N-terminus, and a cytoplasmic C-terminus. Three extracellular loops alternate with three intracellular 

30 loops to link the seven transmembrane regions. The most conserved parts of these proteins are the 
transmembrane regions and the first two cytoplasmic loops. Cysteine disulfide bridges connect the 
second and third extracellular loops. A conserved, acidic-Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, 

35 S. and S. Arkinstall (1994) The G-protein Linked Receptor Facts Book . Academic Press, San Diego 
CA. pp 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 

8 
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associated with neurological disorders such as schizophrenia, Parkinson's disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 

Cytochromes are electron-transferring proteins that contain a heme prosthetic group, a 
porphyrin ring containing a tightly bound iron atom. Cytochromes act as oxidoreductases in such 
5 diverse cellular processes as respiration, photosynthesis, fatty acid metabolism, and neurotransmitter 
biosynthesis. The heme iron atom serves as the actual electron carrier by changing from the ferric to 
the ferrous oxidation state when accepting an electron. Cytochromes accept electrons from one 
substrate such-as NADH or ascorbate and donate them to other electron carriers such as other 
cytochromes, ubiquinone, or semidehydroascorbic acid (Lodish, H. et al. (1995) Molecular Cell 
10 - Biology, Scientific American Books, New York NY, pp. 759-770, 786-797; Sperling, P. et al. (1995) 
Eur. J. Biochem. 232:798-805; and Online Mendelian Inheritance in Man (OMIM) *600019 
Cytochrome b561, CYB561). 

Cytochrome b5 is an electron donor in membrane-linked redox enzyme systems involved in 
lipid and drug metabolism. Cytochrome b5 has been found in Golgi, plasma, outer mitochondrial, 
15 endoplasmic reticulum (ER), and microbody membranes. Conserved amino acids in cytochrome b5 
include eight invariant amino acids at W34, H51, P52, G53, G54, G63, F70, and H74, and fifteen 
conserved amino acids at L24, 135, S36, V41, Y42, N43, T45, W47, A48, L58, D65, T67, L85, T87, 
and G88 (numbering based on the sunflower cytochrome b5/delta-6 desaturase fusion protein; GI 
1040729, Sperling, supra). The invariant residues H51PGG are involved in heme-binding. 
20 Cytochrome b5-like domains have also been found linked to other enzymes. For example, 

cytochrome b5-like domains are part of delta-9 fatty acid desaturases in yeast and Histoplasma 
capsulatum, nitrate reductase, sulfite reductase, flavocytochrome b2, Arabidopsis thaliana acyl lipid 
desaturase, and Borago officinalis (borage) and Helianthus annuus (sunflower) delta-6 desaturases 
(Sperling, supra; Sayanova, O. et al (1997) Proc. Natl. Acad. Sci. USA 94:421 1-4216; and Mitchell, 
25 A.G. and C.E. Martin (1997) J. Biol. Chem. 272:28281-28288). 

Signal peptides are found on proteins that are targeted to the endoplasmic reticulum (ER). 
Signal peptides consist of stretches of amino acids enriched in hydrophobic residues. Signal peptides 
are usually found at the extreme N-terminus of the protein and are recognized by a cytosolic signal- 
recognition peptide (SRP). The SRP binds to the signal peptide and to an SRP receptor, an integral 
30 membrane protein in the ER. Once bound to the SRP receptor, the newly formed protein containing 
the signal peptide is translocated across the ER membrane. Proteins containing signal peptides may 
end up inserted into the lipid bilayer, or they may end up in the lumen of an organelle or secreted from 
the cell. 

35 Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
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transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g. cystic fibrosis, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of-diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
5 membranes include, e.g., cystinuria, iminoglycinuria, Haitup disease, and Fanconi disease (van\ Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332: 1475-1480). 

Cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-Iinked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 
10 urine leads to the formation of cystine stones in the kidneys. 

Transthyretin (TTR), present in human plasma, binds to and transports the thyroid hormone 
thyroxine. Mutations in TTR result in the conversion of TTR to amyloid, an insoluble fibrillar 
structure. The resulting amyloid plaques have been shown to be the causative agent in the 
development of familial amyloid polyneuropathy and senile systemic amyloidosis (Miroy, G.J. et al. 
15 (1996) Proc. Natl. Acad. Sci. USA 93:15051-15056). 

Stomatin, a 31-kDa erythrocyte integral membrane protein has been linked to the hereditary 
anemia stomatocytosis. This anemia is characterized by red blood cells that lack stomatin and leak 
Na+ and K+. Thus, stomatin is presumed to play a role in the regulation of ion transport. Red blood 
cell ion transport defects are also linked to other disorders such as hypertension (Stewart, G.W. 
20 (1997) Int. J. Biochem. Cell Biol. 29:271-274). 

The discovery of new human transport proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, metabolic, neurological, cardiovascular, reproductive, and 
25 immune disorders, and cell proliferative disorders including cancer. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human transport proteins, referred to 
collectively as *TPPT" and individually as "TPFT-l," 'TPPT-2," "TPPT-3/ 1 "TPPT-4," "TPPT-5 " 
'TPPT-6 " "TPPT-7," "TPPT-8," "TPPT-9," 'TPPT-10," "TPPT-1 1," "TPPT-12 "TPPT-13," 
-TPPT-14," "TPPT-15," "TPPT-16," "TPPT-17," "TPPT-18," "TPPT-19," "TPPT-20," "TPPT-21," 
"TPPT-22," "TPPT-23," 'TPPT-24," "TPPT-25." , TPPT-26," "TPPT-27," 'TPPT-28," "TPPT-29," 
"TPPT-30," "TPPT-31," 'TPPT-32," "TPPT-33," 'TPPT-34," "TPPT-35," "TPPT-36." 'TPPT-37," 
"TPPT-38," "TPPT-39," ^TPPT^O/' "TPPT-4 1 "TPPT-42," and 'TPPT^3." In one aspect, the 
invention provides an isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 
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43, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-43. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1-43. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising 
" an amino acid sequence selected from the group consisting of a) an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1^3, b) a naturally occurring amino acid sequence having at 
least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-43, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-43. In one alternative, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-43. In another alternative, the 
15 polynucleotide is selected from the group consisting of SEQ ID NO:44-86. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, b) a naturally occurring amino acid sequence having at least 90% 
20 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. In one alternative, the invention provides a cell transformed 
with the recombinant polynucleotide. In another alternative, the invention provides a transgenic 
25 organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -43, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, 
30 c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. The method comprises a) culturing a cell under conditions 
suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant 
polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 
35 polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
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polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
5 selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. 

The invention further provides an isolated polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
10 sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 
contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
15 sample, said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 

20 to b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a 
probe comprising at least 20 contiguous nucleotides comprising a sequence complementary to said 
target polynucleotide in the sample, and which probe specifically hybridizes to said target 
polynucleotide, under conditions whereby a hybridization complex is formed between said probe and 
said target polynucleotide or fragments thereof, and b) detecting the presence or absence of said 

25 hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 

30 consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said target 
polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) detecting 

35 the presence or absence of said amplified target polynucleotide or fragment thereof, and, optionally, if 
present, the amount thereof. 
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The invention further provides a pharmaceutical composition comprising an effective amount 
of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ. ID NO: I -43, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and a pharmaceutically 
acceptable excipientr~In one embodiment, the pharmaceutical composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-43. The invention additionally provides 
a method of treating a disease or condition associated with decreased expression of functional TPPT, 
comprising administering to a patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an agonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TPPT, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ED NO: 1-43, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an antagonist compound identified by the method and a pharmaceutically 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with overexpression of functional TPPT, comprising administering to a patient in 
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need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the grouptconsisting.of a) an amino . 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. The method comprises a) 
combining the polypeptide with at least one test compound under suitable conditions, and b) 
detecting binding of the. polypeptide to the test compound, thereby identifying a compound that 
specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -43. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:44-86, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding TPPT. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of TPPT. 
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Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding TPPT were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 

_P^ypepjio^s^heJnvendon,along-with-applic^^ 

parameters. 



DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the an, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
30 DEFINITIONS 

'TPPT" refers to the amino acid sequences of substantially purified TPPT obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
TPPT. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TPPT either by directly interacting with 
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TPPT or by acting on components of the biological pathway in which TPPT participates. 

An "allelic variant" is an alternative form of the gene encoding TPPT. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding TPPT include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TPPT or a 
polypeptide with at least one functional characteristic of TPPT. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding TPPT, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
TPPT. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TPPT. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of TPPT is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist'' refers to a molecule which inhibits or attenuates the biological activity 
of TPPT. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TPPT either by 
directly interacting with TPPT or by acting on components of the biological pathway in which TPPT 
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participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TPPT polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
camers^hat-arechernieal^ 



keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal 
10 The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
15 used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
20 sugar groups such as 2-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2-deoxyuracil, or 7-deaza-2'-deox y guanos in e. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
25 translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TPPT, or of any oligopeptide thereof, 
30 to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5'-AGT-3* pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
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amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TPPT or fragments of TPPT may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5 f and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 

Conservative Substitution 



20 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


25 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 


30 


Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


35 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He. Leu, Thr 



40 Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 
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A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
hydrogen by an aikyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a 
polypeptide which retains at least one biological or immunological function of the natural molecule. 
A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that 
-retains-at-least-one-biological or immunology which it was 

derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of TPPT or the polynucleotide encoding TPPT which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
20 from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:44-86 comprises a region of unique polynucleotide sequence that 
25 specifically identifies SEQ ID NO:44-86, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:44-86 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:44-86 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:44-86 and the region of SEQ ID NO:44-86 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-43 is encoded by a fragment of SEQ ID NO:44-86. A fragment 
of SEQ ID NO: 1-43 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-43. For example, a fragment of SEQ ID NO: 1-43 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-43. The precise length of 
a fragment of SEQ ID NO: 1-43 and the region of SEQ ID NO: 1-43 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 

19 
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purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full-length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 
5 "Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 

or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 

10 in thf* <sP.miP.nf f=»<i hp.incr rnmn^rpH in nrnVr tr\ nntimi7A allrmmpnt KptiuApn t\*rr\ coonon^oo nnrl 

therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 

15 molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 

Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8: 189-191 . For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5 ? window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 

20 similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from 
several sources, including the NCBI, Bethesda, MD, and on the Internet at 

25 http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 

30 "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 

programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (Aprii-2 1-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

35 Reward for match: I 

Penalty for mismatch: -2 

20 
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Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
5 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 

over the length^^jragment_taken.from a-larger,-defined-sequence, for-instance,-a-fragment-of-at— — 

least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
10 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
15 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
20 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
25 sequence alignment program (described and referenced above). For pairwise alignments of 

polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 
30 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 
35 Open Gap: 11 and Extension Gap: I penalties 

Gap x drop-off: 50 
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Expect: 10 
Word Size: 3 

Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary* strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 Mg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in pan, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 
1989, Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2. chapter 9. 
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High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
5 reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 

such.as-for-RNA:DNA-hybridizations.~UsefuI-variations on-these -wash conditions-will be-readily 

apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
10 conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
15 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
20 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TPFT which is 
25 capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TPPT which is useful in any of the antibody production methods disclosed herein or known in the 
art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
30 polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of TPPT. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
35 biological, functional, or immunological properties of TPPT. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
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polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-Iike material. 

"Operably linked" refers to the situation in which a First nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, 
and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TPPT may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TPPT. 

"Probe" refers to nucleic acid sequences encoding TPPT, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et at., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al., 1990, PCR 
Protocols, A Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs 
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can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
5 purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 

selection programs have-ineoiporated-additional-features for expanded capabilitiesr-For examplerthe~- 

PrimOU primer selection program (available to the public from the Genome Center at University of 
10 Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
15 oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
20 nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection 
25 are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
30 such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a ceil. 

35 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

25 
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expressed, inducing a protective immunological response in the mammal. 

A "regulatory element** refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5 J and 3* untranslated regions ~~ 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
5 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent,'* in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample** is used in its broadest sense. A sample suspected of containing nucleic 
15 acids encoding TPFT, or fragments thereof, or TPPT itself, may comprise a bodily fluid; an extract 
from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding'' refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
20 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 
25 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution** refers to the replacement of one or more amino acid residues or nucleotides 
30 by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticies and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 
35 A "transcript image" refers to the collective pattern of gene expression by a particular cell 

type or tissue under given conditions at a given time. 
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'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
5 on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 

replication-either-as-an-auto^^ 

as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
10 A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art. 
The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
15 recombinant virus. The term genetic manipulation does not include classical cross-breeding, or m 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
20 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 

25 the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 

30 have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to 

35 each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
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polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
5 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

10 THE INVENTION 

The invention is based on the discovery of new human transport proteins (TPPT), the 
polynucleotides encoding TPPT, and the use of these compositions for the diagnosis, treatment, or 
prevention of transport, metabolic, neurological, cardiovascular, reproductive, and immune disorders, 
and cell proliferative disorders including cancer. 

15 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

TPPT. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each TPfi were identified, and column 4- shows the cDNA libraries from 
which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 

20 libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA 
libraries. In some cases, GenBank sequence identifiers are also shown in column 5. The Incyte 
clones and GenBank cDNA sequences, where indicated, in column 5 were used to assemble the 
consensus nucleotide sequence of each TPPT and are useful as fragments in hybridization 
technologies. 

25 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ED NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 

30 methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding TPPT. The first column of Table 3 lists the nucleotide 
35 SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1. These fragments 
are useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:44-86 
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and to distinguish between SEQ ID NO:44-86 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express TPPT as a fraction of total tissues expressing TPPT. Column 4 
lists diseases, disorders, or conditions associated with those tissues expressing TPPT as a fraction of 
5 total tissues expressing TPPT. Column 5 lists the vectors used to subclone each cDNA library. 

Of particular interest is the expression of SEQ ID NO:50 exclusively in cardiovascular tissue, 
the expression of SEQ ID NO:56 in nervous and gastrointestinal tissues, the expression of SEQ ID 
NO:57-in-gastrointestinal tissues, andlhe express icovof SEQ'ID NO: 66 in nervous system tissues. Of 
particular note is the tissue-specific expression of SEQ ID NO:75. Over 71% of the cDNA libraries 
10 expressing SEQ ID NO:75 are derived from lung tissue. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding TPPT were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 
15 SEQ ID NO:44 maps to chromosome 7 within the interval from 38.80 to 42. 10 centiMorgans. 

SEQ ID NO:48 maps to chromosome X within the interval from 107.90 to 122.80 centiMorgans. 
SEQ ID NO:60 maps to chromosome 2 within the interval from 157.0 to 167.0 centiMorgans. SEQ 
ID NO:65 maps to chromosome 2 within the interval from 17.4 to 40.7 centiMorgans and to 
chromosome 5 within the interval from 61.1 to 69.6 centiMorgans. The interval on chromosome 5 
20 from 61.1 to 69.6 centiMorgans also contains genes associated with Cockayne syndrome. SEQ ID 
NO:69 maps to chromosome 3 within the interval from 157.40 to 162.00 centiMorgans. SEQ ID 
NO:70 maps to chromosome 3 within the interval from 176.40 to 179.80 centiMorgans. SEQ ID 
NO:71 maps to chromosome 18 within the interval from the p-terminus to 52.30 centiMorgans. SEQ 
ID NO:73 maps to chromosome 17 within the interval from 75.70 to 84.20 centiMorgans, and to 
25 chromosome 2 within the interval from 204.70 to 209.30 centiMorgans. SEQ ID NO:76 maps to 
chromosome 20 within the interval from 79.00 to 94.40 centiMorgans. SEQ ID NO:80 maps to 
chromosome 1 8 within the interval from 1 .60 to 6.20 centiMorgans, and to chromosome 1 1 within the 
interval from 1 17.90 to 126.00 centiMorgans. SEQ ID NO:83 maps to chromosome 17 within the 
interval from 67.60 to 69.30 centiMorgans, and from 83.8 centiMorgans to the q-terminus, and to 
30 chromosome 7 within the interval from 105.20 to 1 14.50 centiMorgans. 

The invention also encompasses TPPT variants. A preferred TPPT variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TPPT amino acid sequence, and which contains at least one functional or 
structural characteristic of TPPT. 
35 The invention also encompasses polynucleotides which encode TPPT. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
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from the group consisting of SEQ ID NO:44-86, which encodes TPPT. The polynucleotide sequences 
of SEQ ID NO:44-86, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 
5 The invention also encompasses a variant of a polynucleotide sequence encoding TPPT. In 

particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TPPT. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID NO:44- 
10 86 which has at least about 70%, or alternatively at least about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:44-86. Any one of the polynucleotide variants described above can encode an amino acid 
sequence which contains at least one functional or structural characteristic of TPPT. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
15 genetic code, a multitude of polynucleotide sequences encoding TPPT, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
20 polynucleotide sequence of naturally occurring TPPT, and all such variations are to be considered as 
being specifically disclosed. 

Although nucleotide sequences which encode TPPT and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring TPPT under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TPPT or 
25 its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons, Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TPPT and its derivatives without altering the encoded amino acid sequences 
30 include the production of RNA transcripts having more desirable properties, such as a greater 
half-hfe, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TPPT and 
TPPT derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
35 using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding TPPT or any fragment thereof. 
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Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:44-86 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
5 152:507-5 1 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embewdiments-of-the inventionT-The^^riods may employ such enzymes as the Klenow fragment 

of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 
10 Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, 

Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found 
in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, 
sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer 
system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 
15 CATALYST 800 thermal cycler (PE Biosystems). Sequencing is then carried out using either the 
ABI 373 or 377 DNA sequencing system (PE Biosystems), the MEGABACE 1000 DNA sequencing 
system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
Ausubel, F.M. (1997) Short Proto cols in Molecular Biology . John Wiley & Sons, New York NY, unit 
20 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 
856-853.) 

The nucleic acid sequences encoding TPPT may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 

25 restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 

30 Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Appiic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 

35 are known in the art. (See, e.g., Parker, J.D. et al. ( 1991 ) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFTNDER libraries (Clontech, Palo 
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Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
5 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 
10 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5* non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
15 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
20 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode TPPT may be cloned in recombinant DNA molecules that direct expression of TPPT, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same or a functionally 
25 equivalent amino acid sequence may be produced and used to express TPPT. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TPPT-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
30 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
' as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
35 5,837,458; Chang v C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol- 17:259-264; and Crameri. A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
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improve the biological properties of TPPT, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DNA shuffling is a process by. which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
-point-mutationsnmay b^fecc^ 



optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding TPPT may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Sen 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
TPPT itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins. St ructures and Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
TPPT, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TPPT, the nucleotide sequences encoding TPPT or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translation^ control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3 * untranslated regions in the vector and in polynucleotide sequences 
encoding TPPT. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TPPT. Such signals 
35 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

sequences encoding TPPT and its initiation codon and upstream regulatory sequences are inserted into 
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the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 

translational control signals including an in-frame ATG initiation codon should be provided by the - - * 

vector. Exogenous translational elements and initiation codons may be of various origins, both 
5 natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 

appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. 

Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 

vectors containing sequences encoding TPPT and appropriate transcriptional and translational control 
10 elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 

vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory 

Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 

Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
15 encoding TPPT. These include, but are not limited to, microorganisms such as bacteria transformed 

with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 

yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 

plant eel! systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 

tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
20 animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 

(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 

Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. 

Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Then 7:1937-1945; Takamatsu, N. 

(1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
25 Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hill 

Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. 

and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. 

Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or 

vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences 
30 to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. 

Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Bulier, R.M. 

et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 3 1(3):219-226; 

and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host 

cell employed. 

35 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding TPPT. For example, routine cloning, 
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subcloning, and propagation of polynucleotide sequences encoding TPPT can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TPPT into the vector's multiple cloning 
site disrupts the lacL gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and SM. Schuster (1989) J. Biol. 

Chem..264:5503r5509.)_\\aien-large.quantities.of_TPCT_are_neede 

antibodies, vectors which direct high level expression of TPPT may be used. For example, vectors 
containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TPPT. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra : Bitter, supra : and Scorer, supra .) 

Plant systems may also be used for expression of TPPT. Transcription of sequences encoding 
TPPT may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, supra : Broglie, supra : and Winter, supra .) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
e *g* The McGr aw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, 
pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TPPT may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TPPT in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
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15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
TPPT in cell lines is preferred. For example, sequences encoding TPPT can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
5 expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 
before being switched to selective media. The purpose of the selectable marker is to confer resistance 
to a selective agent, and its presence allows growth and recovery of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 

10 culture techniques appropriate to the ceil type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et al. 
(1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or 

15 herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 

* ' > \ * s x 1 ww. i^o-ii. /\cau. oci. vjJA / / .jjO / / \j, ^OiuciC-vjfU Spin, r. ei 3.1. y 1 i ) 

J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 

20 alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 6 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 

25 (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TPPT is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TPPT can be identified by the absence of marker gene function. Alternatively, a 

30 marker gene can be placed in tandem with a sequence encoding TPPT under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TPPT and that express 
TPPT may be identified by a variety of procedures known to those of skill in the art. These 

35 procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
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chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of TPPT using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
5 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TPPT is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hafnpton7R. et airflWO TSerological Methods, a Laboratory Manual . APS Press. St Pan? MM 
Sect. IV ; Coligan, J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates and 
10 Wiley-Intersciehce, New York NY; and Pound, J.D. (1998) Immunochemical Protocols ; Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TPPT 

15 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TPPT, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 

20 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates,, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TPPT may be cultured under 

25 conditions suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode TPPT may be designed to contain signal sequences which 
direct secretion of TPPT through a prokaryotic or eukaryotic cell membrane. 

30 111 addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

35 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
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American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

...... In another embodiment of the inventions-natural, modified, or recombinant- nucleic aeki 

sequences encoding TPPT may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric TPPT protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TPPT activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding proiein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc 9 and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal- 
chelate resins, respectively. FLAG, c-myc y and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the TPPT encoding sequence and the heterologous protein 
sequence, so that TPPT may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubei (1995, supra , ch. 10). 
A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TPPT may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
specifically bind to TPPT. At least one and up to a plurality of test compounds may be screened for 
specific binding to TPPT. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TPPT, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TPPT 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TPPT, either as a secreted 
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protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Prosophila . or E. 
coli. Cells expressing TPPT or cell membrane fractions which contain TPPT are then contacted with 
a test compound and binding, stimulation, or inhibition of activity of either TPPT or thtrcompound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TPPT, either in 
-solution or affixed'to^asolid support, and"detectingThe binding of TPPT to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of TPPT. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for TPPT 
activity, wherein TPPT is combined with at least one test compound, and the activity of TPPT in the 
presence of a test compound is compared with the activity of TPPT in the absence of the test 
compound. A change in the activity of TPPT in the presence of the test compound is indicative of a 
compound that modulates the activity of TPPT. Alternatively, a test compound is combined with an 
invitro or cell-free system comprising TPPT under conditions suitable for TPPT activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of TPPT 
may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TPPT or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
( 1 989) Science 244: 1 288- 1 292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
35 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding TPPT may also be manipulated in vitro in ES cells derived from 
5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1 145-1 147). 

Polynucleotides encoding TPPT can also be used to create "knockin" humanized animals 

10 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 

region of a polynucleotide encoding TPPT is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 

15 Alternatively, a mammal inbred to overexpress TPPT, e.g., by secreting TPPT in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TPPT and human transport proteins. In addition, the expression of TPPT is 

20 closely associated with neurological, cardiovascular, reproductive, gastrointestinal, and 

hematopoietic/immune tissues, and inflammation, cell proliferation, and cancer. Therefore, TPPT 
-appears to play a role in transport, metabolic, neurological, cardiovascular, reproductive, and immune 
disorders, and cell proliferative disorders including cancer. In the treatment of disorders associated 
with increased TPPT expression or activity, it is desirable to decrease the expression or activity of 

25 TPPT. In the treatment of disorders associated with decreased TPPT expression or activity, it is 
desirable to increase the expression or activity of TPPT. 

Therefore, in one embodiment, TPPT or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of TPPT. 
Examples of such disorders include, but are not limited to, a transport disorder such as akinesia, 

30 amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular dystrophy, 

Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, diabetic neuropathy, 
Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic periodic paralysis, 
Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, myotonic 
dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral neoplasms, 

35 prostate cancer; cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension. Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
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myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 
ethanol myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis; 
neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia; and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 
neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
-Gushing-sdiseaserAddison-s-diseaserglucose-galactose-malabsorption-syndromei- 



hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
10 syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
metabolic disorder such as Addison's disease, cerebrotendinous xanthomatosis, congenital adrenal 
hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocirrhosis, 

fructose- 1,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperad renal ism, hypoadrenalism, hyperparathyroidism, 
15 hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 
mannosidosis, neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitamin D- 
deficiency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 

20 disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and 

25 Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 

30 peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 

35 frontotemporai dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 
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thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, — - — • * 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital lung 
anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary 
disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic 
bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, 
lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, 
idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, 
pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary 
hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary 
involvement in collagen- vascular disorders, pulmonary alveolar proteinosis, lung tumors, 
inflammatory and noninflammatory pleura! effusions, pneumothorax, pieurai tumors, drug-induced 
lung disease, radiation-induced lung disease, and complications of lung transplantation; a 
reproductive disorder such as a disorder of prolactin production, infertility, including tubal disease, 
ovulatory defects, and endometriosis, a disruption of the estrous cycle, a disruption of the menstrual 
_ cycle,, polycystic ovary syndrome, ovarian hyperstimul at ion syndrome,, an endometrial or ovarian 
tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea; a disruption of spermatogenesis, abnormal sperm 
physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, 
Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; an immune disorder 
such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' 
disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, 
irritable bowel syndrome, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia 
gravis, myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, 
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pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, 
scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, primary thrombocy the mia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
5 fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 
lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MeTD)rmyelofibrosisrparoxysmalnoc^ 

thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
10 myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TPFT or a fragment or derivative 
15 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TPPT including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
TPPT in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of TPPT including, but not 
20 limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of TPPT may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TPPT including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TPPT may be administered to a subject to treat or 
25 prevent a disorder associated with increased expression or activity of TPPT. Examples of such 
disorders include, but are not limited to, those transport, metabolic, neurological, cardiovascular, 
reproductive, and immune disorders, and cell proliferative disorders including cancer described 
above. In one aspect, an antibody which specifically binds TPPT may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
30 cells or tissues which express TPPT. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding TPPT may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TPPT including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
35 sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
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by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 
5 An antagonist of TPPT may be produced using methods which are generally known in the art. 

In particular, purified TPPT may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TPPT. Antibodies to TPPT may also 
be generated using methods that are well known in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
10 fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TPPT or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 

15 increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calrnette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to TPPT 

20 have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of 
at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of TPPT 
amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 
molecule may be produced. 

25 Monoclonal antibodies to TPPT may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 

30 Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81 :685 1-6855: Neuberger, M.S. et al. (1984) Nature 3 12:604-608; and Takeda, 

35 S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TPPT-specific single 
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chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

-Antibody-fragments-which contain specific binding-sites-for-TPPT-may-also be-generated, 



10 



15 



For example, such fragments include, but are not limited to, F(ab% fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TPPT and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering TPPT epitopes is generally used, but a competitive binding assay may 
20 also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TPPT. Affinity is expressed as an 
association constant, PC,, which is defined as the molar concentration of TPPT-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The PL, determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple TPPT epitopes, represents the average affinity, or avidity, of the antibodies for 
TPPT. The K, determined for a preparation of monoclonal antibodies, which are monospecific for a 
particular TPPT epitope, represents a true measure of affinity. High-affinity antibody preparations 
with K, ranging from about 10' to 10 12 L/mole are preferred for use in immunoassays in which the 
TPPT-antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations 
with K, ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of TPPT, preferably in active form, from the 
antibody (Catty, D. (1988) Antibodies. V olume I: A Practical Annroach . IRL Press, Washington DC; 
Liddeil, J.E. and A. Cryer (1991) A Practical C.iiirie to Monoclonal Antibodies . John Wiley & Sons, 
35 New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
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determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 
5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitationofTPPT^. 
antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines 
5 for antibody quality and usage in various applications, are generally available. (See, e.g., Catty, 
supra , and Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding TPPT, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 

10 RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
TPPT. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
TPPT. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., Totawa NJ.) 
In therapeutic use, any gene delivery system suitable for introduction of the antisense 

15 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

intracellular^ in the form of an expression plasmid which- upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(1 3): 1288- 1296.) Antisense sequences can also be introduced intracellularly through the use of viral 

20 vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther ( 1 994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 5 l(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1): 1308-13 1 5; and Morris, M.C. et ai. (1997) Nucleic Acids Res. 

25 25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TPPT may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

30 immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thaiassamias, famiiiai 
hypercholesterolemia, and hemophilia resulting from Factor Vin or Factor EX deficiencies (Crystal, 

35 R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 

express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
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cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency vims (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et ah (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruziV In the 
case where a genetic deficiency in TPPT expression or regulation causes disease, the expression of 
TPPT from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by-the-genetic-defieiency^ — 



In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
) TPPT are treated by constructing mammalian expression vectors encoding TPPT and introducing 
these vectors by mechanical means into TPPT-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TPPT include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, PTET-ON, 
PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TPPT may be expressed using (i) a 
constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 
virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TPPT from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 
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In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TPPT expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TPPT under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
5 element (RRE) along with additional retrovirus c/s-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 

10 receptors on the target cells or a promiscuous envelope protein such as VSVg (Arrnentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. ViroL 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 

15 discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 

reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), 
and the return of transduced cells to a patient are procedures well known to persons skilled in the art 
of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; 
Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, 

20 U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
.polynucleotides encoding TPPT to cells which have one or more genetic abnormalities with respect to 
the expression of TPPT. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 

25 versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Arrnentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. 
Rev. Nutr. 19:51 1-544; and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both 

30 incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TPPT to target cells which have one or more genetic abnormalities with 
respect to the expression of TPPT. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TPPT to cells of the central nervous system, for which HSV has a 

35 tropism. The construction and packaging of herpes-based vectors are well known to those with 

ordinary skill in the an. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
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been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
-and-ieP22r-For HS V-vectorsrsee-also-Goins7W:F-et-ar(l 999rJ7Virol773:5 l9-532^d Xu7H7etal7 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ' 
ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TPPT to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TPPT into the alphavirus 
genome in place of the capsid-coding region results in the production of a large number of TPPT- 
coding RNAs and the synthesis of high levels of TPPT in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
25 infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of TPPT into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
30 infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 
performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition 
can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
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been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and BJ. Can\ 
Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translationofcraRNA 
by preventing the transcript from binding to ribosomes. 
5 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TPPT. 
10 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
15 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
20 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 

sequences encoding TPPT. Such DNA sequences may be incorporated into a wide variety of vectors 
with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs 
that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, 
cells, or tissues. 

25 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or 2 1 0-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 

30 and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TPPT. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 

35 limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
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chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased TPPT 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding TPPT may be therapeutically useful, and in the treament of disorders associated with 
decreased TPPT expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding TPPT may be therapeutically useful. 

At-Ieast one, and up to a-plurality^of-test compounds-may be screened-foreffectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TPPT is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding TPPT are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding TPPT. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pombe gene expression system 
(Atkins, D. et al. (1999) U.S. Patent No. 5.932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 
28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, IW. et al. (2000) U.S. Patent 
No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
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using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
5 monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutical^ 
acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest 

10 edition of Remington's Pha rmaceutical Sciences (Maack Publishing, Easton PA). Such 

pharmaceutical compositions may consist of TPPT, antibodies to TPPT, and mimetics, agonists, 
antagonists, or inhibitors of TPPT. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

15 intramedullary, intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 
powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 

20 delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. 

larger peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 

25 enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
30 del ivery of macromolecules comprising TPPT or fragments thereof. For example, liposome 

preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, TPPT or a fragment thereof may be joined to a short 
cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 
found to transduce into the cells of all tissues, including the brain, in a mouse model system 
35 (Schwarze, S.R. et al. ( 1 999) Science 285: 1 569- 1 572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
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culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example TPPT 
or fragments thereof, antibodies of TPPT, and agonists, antagonists or inhibitors of TPPT, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD^ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDx/ED 50 ratio. Pharmaceutical compositions 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED*, with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

Normal dosage amounts may vary from about 0. 1 ptg to 100,000 ^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TPPT may be used for the 
diagnosis of disorders characterized by expression of TPPT, or in assays to monitor patients being 
treated with TPPT or agonists, antagonists, or inhibitors of TPPT. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for TPPT include methods which utilize the antibody and a label to detect TPPT in human body fluids 
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or in extracts of cells or tissues. The antibodies may be used with or without modification, and may 
be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter 

molecules, several of which are described above, are known in the an and may be used .... 

A variety of protocols for measuring TPPT, including ELIS As, RIAs, and FACS, are known 
5 in the art and provide a basis for diagnosing altered or abnormal levels of TPPT expression. Normal 
or standard values for TPPT expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibody to TPPT under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TPPT expressed in subject, 

!0 control, and disease samples from biopsied tissues are compared wkh the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TPPT may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 

15 quantify gene expression in biopsied tissues in which expression of TPPT may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TPPT, and to monitor regulation of TPPT levels during therapeutic intervention. 

In one aspect, hybridization with PGR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TPPT or closely related molecules may be used to 

20 identify nucleic acid sequences which encode TPPT. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TPPT, allelic variants, or related 
sequences. 

25 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TPPT encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:44-86 or from 
genomic sequences including promoters, enhancers, and introns of the TPPT gene. 

Means for producing specific hybridization probes for DNAs encoding TPPT include the 

30 cloning of polynucleotide sequences encoding TPPT or TPPT derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 

35 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TPPT may be used for the diagnosis of disorders 
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associated with expression of TPPT. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 
5 normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer; cardiac disorders associated with transport, e.g., 
anginarbradyarrythm^ Long q T syn drome, myocarditis, 

cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
10 myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 

infectious myositis, polymyositis; neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia; and other disorders associated with transport, e.g., neurofibromatosis, 
postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
15 cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and 
Fanconi disease; a metabolic disorder such as Addison's disease, cerebrotendinous xanthomatosis, 
20 congenital adrenal hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocirrhosis, 
fructose- 1 ,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 
25 mannosidosis, neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitamin D- 
deficiency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
30 sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and 
Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
35 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
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nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 
thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive bean failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital lung 
anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive 
pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, 
emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and 
mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, 
pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, 
hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, 
diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary 
hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar 
proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, 
pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung 
transplantation; a reproductive disorder such as a disorder of prolactin production, infertility, 
including tubal disease, ovulatory defects, and endometriosis, a disruption of the estrous cycle, a 
disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an 
endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and 
teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a disruption of 
spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the prostate, benign 
prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and 
gynecomastia; an immune disorder such as inflammation, actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
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allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, cholecystitis, cirrhosis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel 

syndromermixed connective tissue disease (MeTD)rmultiple sclerosis r myasthenia gravis^ 

myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 
polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythernia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 
lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythernia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. The polynucleotide sequences encoding TPPT may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in 
dipstick, pin, and multiformat ELISA-Iike assays; and in microarrays utilizing fluids or tissues from 
patients to detect altered TPPT expression. Such qualitative or quantitative methods are well known 
in the art. 

In a particular aspect, the nucleotide sequences encoding TPPT may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TPPT may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to a 
control sample then the presence of altered levels of nucleotide sequences encoding TPPT in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 
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In order to provide a basis for the diagnosis of a disorder associated with expression of TPPT, 
a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human; with a sequence, ora 
fragment thereof, encoding TPPT, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified polynucleotide is 
used. Standard values obtained in this manner may be compared with values obtained from samples 
from patients who are symptomatic for a disorder. Deviation from standard values is used to establish 
the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual ciinicai symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 
or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding TPPT 
may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding TPPT, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
TPPT, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TPPT may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TPPT are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the 
secondary and tertiary structures of PCR products in single-stranded form, and these differences are 

58 

i ■> 



WO 00/78953 



PCT/US00/16668 



detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers 
are fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such 
as DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico 
SNP (isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
5 overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 

altemative, SNPs may-be detected-and eharaeterized-by-mass-spectrometry-usingrfor examplerthe 

high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 
10 Methods which may also be used to quantify the expression of TPPT include radiolabeling or 

biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
15 interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 

20 numbers of genes simultaneously as described in Seilhamer, J.J. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

25 monitor the activities of therapeutic agents in the treatment of disease. In particular, this information 
may be used to develop a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

30 1x1 another embodiment, antibodies specific for TPPT, or TPPT or fragments thereof may be 

used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

Microarrays may be prepared, used, and analyzed using methods kjiown in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

35 USA 93:10614-10619; Baldeschweiier et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. ScL USA 94:2150- 
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2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarrays: A Practical Approach . M. Schena, ed! (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TPPT may be used 
5 to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
10 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7: 149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop 
15 genetic linkage maps, for example, which correlate the inheritance of a disease state with the 

inheritance of a particular chromosome region or restriction fragment length polymorphism (RFLP). 
(See, e.g., Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic 

20 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding TPPT on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

Jn situ hybridization of chromosomal preparations and physical mapping techniques, such as 

25 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 

Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveaJ associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

30 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

35 In another embodiment of the invention, TPPT, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
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screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between TPPT and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TPPT, or fragments thereof, 
and-washed.-Bound-TPPT-is then detected by methods well known in the art-Purified TPPT'cairalso 
be coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TPPT specifically compete with a test compound for binding TPPT. In 
this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with TPPT. 

In additional embodiments, the nucleotide sequences which encode TPPT may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, in 
particular U.S. Ser. No. 60/139,923, U.S. Ser. No. 60/148,177, U.S. Ser. No. 60/149,357, and U.S. 
Ser. No. 60/162,287, are hereby expressly incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
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using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POL Y(A)PURE. mRNA. . 
purification kit (Ambion, Austin TX). 
5 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

10 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 

15 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XL 1 -Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH103 from Life Technologies. 

II. Isolation of cDNA Clones 

20 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 

25 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

30 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
35 Sequencing reactions were processed using standard methods or high-throughput instrumentation 

such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
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Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 
5 separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 

software^or-other-sequenee-analysis-systems known in the art— Reading frames- within- the cDNA 

sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some 
10 of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 
15 programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
20 software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 

(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

25 The polynucleotide sequences were validated by removing vector, linker, and polyA 

sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, ' 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

30 annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
amino acid sequences, and these full length sequences were subsequently analyzed by querying 

35 against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
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as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
5 NO;44-86. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
10 from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 
1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
15 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

20 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 

25 (HSP), and -4 for every mismatch. Two sequences. may share more than one HSP (separated by gaps). 
If there is more than one HSP, then the pair with the highest BLAST score is used to calculate the 
product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 

30 either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 

other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding TPFT occurred. Analysis involved the categorization of cDNA libraries by 

35 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/irnrnune, musculoskeletal, nervous, 
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reproductive, and uroiogic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
5 Table 3. 

V. Chromosomal Mapping of TPPT Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:44-49 and SEQ ID NO:54-86 

were compared with sequences from theTncyterL:IFESEQ"database and^blic^orMi^atabases 

using BLAST and other implementations of the Smith- Waterman algorithm. Sequences from these 
10 databases that matched SEQ ID NO:44-49 and SEQ ID NO:54-86 were assembled into clusters of 
contiguous and overlapping sequences using assembly algorithms such as Phrap (Table 5). Radiation 
hybrid and genetic mapping data available from public resources such as the Stanford Human 
Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR), and Genethon were 
used to determine if any of the clustered sequences had been previously mapped. Inclusion of a 
15 mapped sequence in a cluster resulted in the assignment of all sequences of that cluster, including its 
particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:44, SEQ ID NO:48, SEQ ID NO:60, SEQ ID 
NO:65, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:73, SEQ ID NO:76, SEQ ID 
NO:80,and SEQ ID NO:83 are described in The Invention as ranges, or intervals, of human 

20 chromosomes. More than one map location is reported for SEQ ID NO:65, SEQ ID NO:73, SEQ ID 
NO:80, and SEQ ID NO:83, indicating that previously mapped sequences having similarity, but not 
complete identity, to SEQ ID NO:65, SEQ ID NO:73, SEQ ID NO:80, and SEQ ID NO:83 were 
assembled into their respective clusters. The map position of an interval, in centiMorgans, is 
measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of 

25 measurement based on recombination frequencies between chromosomal markers. On average, 1 cM 
is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to 
hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by 
Genethon which provide boundaries for radiation hybrid markers whose sequences were included in 
each of the clusters. Diseases associated with the public and Incyte sequences located within the 

30 indicated intervals are also reported in the Invention section where applicable. Human genome maps 
and other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of TPPT Encoding Polynucleotides 

35 The full length nucleic acid sequences of SEQ ID NO:44-86 were produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
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fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 _ 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-weIl plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 \xl PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 //l to 10 jA aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose mini -gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 
384-well plates in LB/2x carb liquid media. 
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The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94°C, 3 min; Step 2: 94°C, J5 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted with 
20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
-primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:44-86 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:44-86 are employed to screen cDNAs, 
15 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /uCl of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
20 (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). An 
aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane -based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase 
I, Bgl H, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 
25 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
30 compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra) , 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
35 aforementioned technologies should be uniform and solid with a non-porous surface (Schena ( 1999), 
supra). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
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Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods-and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
5 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can 
be selected using software well known in the art such as LASERGENE software (DNASTAR). The 

10 array elements arc hybridized with poIyTiucieouues in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

15 complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

20 poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/ul oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/ul RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 uM dTTP, 40 uM 
dCTP, 40 uM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

25 GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 

30 (CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 

using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 ul 5X SSC/0.2% SDS. 
Microarray Preparation 

35 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
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uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 hg to a final quantity greater than 5 
ug. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

5 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
SSratific PrOT^ p A)t washed extensively in distilled water, and 

coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
10 oven. 

, Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 ul of the array element DNA, at an average 
concentration of 100 ng/ul, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

15 Microarrays are UV-crossI inked using a STRATALINKER UV-cross linker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 0.2% 
SDS and distilled water as before. 

20 Hybridization 

Hybridization reactions contain 9 ul of sample mixture consisting of 0.2 ug each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 Ml of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60°C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1 X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
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In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
5 emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
0 the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
5 fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

IX. Complementary Polynucleotides 

Sequences complementary to the TPPT-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring TPPT. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of TPPT. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the TPPT-encoding transcript. 

X. Expression of TPPT 
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Expression and purification of TPPT is achieved using bacterial or virus-based expression 
systems. For expression of TPPT in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TPPT upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG) 

or mammalian cell lines with recombinant Autoeraphica califomica nuclear polyhidrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TPPT by either homologous recombination or bacterial -mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugioerda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, TPPT is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistoso ma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
TPPT at specifically engineered sites. FLAG, an 8-amino acid peptide, enables irnrnunoaffinity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra, ch. 10 and 16). Purified TPPT obtained by these methods can be used directly in the assays 
shown in Examples XI and XV. 
XI. Demonstration of TPPT Activity 

TPPT transport activity is assayed by measuring uptake of labeled substrates into Xenopus 
laeyis oocytes. Oocytes at stages V and VI are injected with TPPT mRNA (10 ng per oocyte) and 
incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCI 2 , ImM MgCl 2 , 
1 mM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50ug/mJ gentamycin, pH 7.8) to allow expression of 
TPPT. Oocytes are then transferred to standard uptake medium ( lOOmM NaCl, 2 mM KCI, ImM 
CaCI 2 , ImM MgCU 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino acids, 
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sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. radiolabeled 
with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 minutes, 
uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 
incorporated label, and comparing with controls. TPPT activity is proportional to the level of 
5 internalized labeled substrate. 
XII. Functional Assays 

TPPT function is assessed by expressing the sequences encoding TPPT at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
10 include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 fig of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1 -2 //g of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 

15 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

20 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 

25 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, New York NY. 

The influence of TPPT on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding TPPT and either CD64 or CD64-GFP. CD64 and CD64- 

30 GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding TPPT and other genes of interest can be analyzed by northern analysis 

35 or microarray techniques. 
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XIIL Production of TPPT Specific Antibodies 

TPPT substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488^95), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
5 Alternatively, the TPPT amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selectionofappropriateepitopes, sucrTasthose near th^C^temiinus or iFH^r^fiilicregions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

10 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 

peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 

15 TPPT activity by, for example, binding the peptide or TPPT to a substrate, blocking with 1 % BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring TPPT Using Specific Antibodies 
Naturally occurring or recombinant TPPT is substantially purified by immunoaffinity 

chromatography using antibodies specific for TPPT. An immunoaffinity column is constructed by 
20 covalently coupling anti-TPPT antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing TPPT are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of TPPT (e.g., high ionic strength 
25 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/TPPT binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as 

urea or thiocyanate ion), and TPPT is collected. 

XV. Identification of Molecules Which Interact with TPPT 

TPPT, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent. 
30 (See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 

previously arrayed in the wells of a multi-well plate are incubated with the labeled TPPT, washed, and 
any wells with labeled TPPT complex are assayed. Data obtained using different concentrations of 
TPPT are used to calculate values for the number, affinity, and association of TPPT with the 
candidate molecules. 

35 Alternatively, molecules interacting with TPPT are analyzed using the yeast two-hybrid 

system as described in Fields. S. and O. Song (1989, Nature 340:245-246), or using commercially 
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available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

TPFT may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner. to determine alLinteractions. 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
5 No. 6,057,101). 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
0 understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 

SEQJD-NO:47,-SEQ4D-NO:J-8,^ 

SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, 
10 SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, 
and SEQ ID NO:43, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 

15 NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, 
SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 

20 SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID 
NO:43, 

c ) a biologically active fragment of an amino acid sequence selected from the group 

consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 

25 NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID NO:43, and 

30 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23 T SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 

35 NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , SEQ ID NO:32 ? SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36. SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
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NO:41, SEQ ID NO:42, and SEQ ID NO:43. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 

5 NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID 
NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID 

10 NO:42, and SEQ ID NO:43. 



3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

15 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57. SEQ ID 
NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID 

20 NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID 
NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:8 1 , SEQ ID NO:82, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86. 

25 6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 

polynucleotide of claim 3. 



7. A cell transformed with a recombinant polynucleotide of claim 6. 



30 8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



9. A method for producing a polypeptide of claim I, the method comprising: 
a) culmring a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
35 comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

11. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
J group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID 
NO:45 , SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1 , SEQ ID 

NO:52, SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO^TTSEQ ID NOT587SEQ ID 

NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:8I, SEQ ID NO:82, SEQ ID NO:84, SEQ ID 
NO:85, SEQ ID NO:86, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID NO:45, SEQ 
ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID 
NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 
NO:66, SEQ ID NO:67 t SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO^O, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO^ SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1. 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 



14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutically acceptable excipient. 

17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO: l , SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4 ? SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:i0, SEQ ID 
NO: 12, SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:3 1, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:4I, SEQ ID NO:42, and SEQ ID NO:43. 

18. A method for treating a disease or condition associated with decreased expression of 
functional TPPT, comprising administering to a patient in need of such treatment the pharmaceutical 
composition of claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 19 and a pharmaceutically acceptable excipient. 
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21 . A method for treating a disease or condition associated with decreased expression of 
functional TPPT, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 20. 

5 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim I to a compound, and 
b ) detecti ng antag onist activity in the sam ple. 

10 23* A pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 22 and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
TPPT, comprising administering to a patient in need of such treatment a pharmaceutical composition 
15 of claim 23. 



25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
20 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
25 claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

30 c ) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 

with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

35 - 7 - A method for screening a compound for effectiveness in altering expression of a target 

polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
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comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 

5 28. An isolated polynucleotide comprising a polynucleotide sequence of SEQ ID NO:83. 

29. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 28. 

10 30. A cell transformed with a recombinant polynucleotide of claim 29. 

31. A transgenic organism comprising a recombinant polynucleotide of claim 29. 

32. A method for producing a polypeptide'comprising an amino acid sequence of SEQ ID 
15 NO:40, the method comprising: 

a) culturing the cell of claim 30 under conditions suitable for expression of the polypeptide, 

and 

b) recovering the polypeptide so expressed. 

20 33. A method for screening a compound for effectiveness in altering expression of a target 

polynucleotide, wherein said target polynucleotide comprises a sequence of claim 28, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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SEQUENCE LISTING 



<110> INCYTE GENOMICS, INC. 
LAL, Preeti 
YANG , Junraing 
YUE, Henry 
HILLMAN, Jennifer L. 
TANG, Y. Tom 
BANDMAN, Olga 
BURFORD , Neil 
BAUGHN , Mar i ah R. 

AZIMZAI, Yalda 

LU~ Dyung Aina~M" 

AU-YOUNG, Janice 
PATTERSON, Chandra 

<120> HUMAN TRANSPORT PROTEINS 

<130> PF-0709 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/139,923; 60/148,177; 60/149,357; 60/162,287 

<151> 1999-06-17; 1999-08-10; 1999-08-18; 1999-10-28 

<160> 86 

<170> PERL Program 

<210> 1 

<211> 623 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 264114CD1 

<400> 1 



Met 


Ser 


Thr 


Gin 


Asp 


Glu 


Arg 


Gin 


He 


Asn 


Thr 


Glu 


Tyr 


Ala 


Val 


1 








5 










10 








15 


Ser 


Leu 


Leu 


Glu 


Gin 


Leu 


Lys 


Leu 


Phe 


Tyr 


Glu 


Gin 


Gin 


Leu 


Phe 


Thr 








20 










25 










30 


Asp 


He 


Val 


Leu 


He 


Val 


Glu 


Gly 


Thr 


Glu 


Phe 


Pro 


Cys 


His 










35 










40 








45 


Lys 


Met 


Val 


Leu 


Ala 


Thr 


Cys 


Ser 


Ser 


Tyr 


Phe 


Arg 


Ala 


Met 


Phe 










50 










55 








60 


Met 


Ser 


Gly 


Leu 


Ser 
65 


Glu 


Ser 


Lys 


Gin 


Thr 
70 


His 


Val 


His 


Leu 


Arg 
75 


Asn 


Val 


Asp 


Ala 


Ala 


Thr 


Leu 


Gin 


He 


He 


He 


Thr 


Tyr 


Ala 


Tyr 


Thr 








80 










85 








90 


Gly 


Asn 


Leu 


Ala 
95 


Met 


Asn 


Asp 


Ser 


Thr 
100 


Val 


Glu 


Gin 


Leu 


Tyr 
105 


Glu 


Thr 


Ala 


Cys 


Phe 


Leu 


Gin 


Val 


Glu 


Asp 


Val 


Leu 


Gin 


Arg 


Cys 




Glu 






110 










115 








120 


Arg 


Tyr 


Leu 


He 


Lys 


Lys 


He 


Asn 


Ala 


Glu 


Asn 


Cys 


Val 


Arg 










125 










130 








135 


Leu 


Leu 


Ser 


Phe 


Ala 


Asp 


Leu 


Phe 


Ser 


Cys 


Glu 


Glu 


Leu 


Lys 


Gin 




Ala 






140 










145 








150 


Ser 


Lys 


Arg 


Met 


Val 


Glu 


His 


Lys 


Phe 


Thr 


Ala 


Val 


Tyr 


His 










155 










160 








165 


Gin 


Asp 


Ala 


Phe 


Met 
170 


Gin 


Leu 


Ser 


His 


Asp 
175 


Leu 


Leu 


He 


Asp 


He 
180 


Leu 


Ser 


Ser 


Asp 


Asn 
185 


Leu 


Asn 


Val 


Glu 


Lys 
190 


Glu 


Glu 


Thr 


Val 


Arg 
195 


Glu 


Ala 


Ala 


Met 


Leu 


Trp 


Leu 


Glu 


Tyr 


Asn 


Thr 


Glu 


Ser 


Arg 


Ser 
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200 

Gin Tyr Leu Ser Ser Val Leu Ser Gin 
215 

Ser Glu Val Thr Gin Arg Ala Trp Phe 
230 

Asp Lys Ser Val Val Val Gin Gly Leu 
245 

Phe Phe Lys Pro Arg Leu Gly Met Thr 
260 

Phe lie Glu Ala Ser Ser Glu Asn Pro 
275 

Val Cys Tyr Ser Pro Gin Ala Glu Lys 
290 

Pro Pro Ala Asp Leu His Lys Val Gly 
305 

Asn Asp lie Tyr lie Ala Gly Gly Gin 
320 

Lys Thr Asn II is Ser Lys Thr Ser Lys 
335 

Thr Val Asn Cys Phe Tyr Trp Phe Asp 
350 

Phe Pro Lys Thr Pro Met Leu Phe Val 
365 

Val Cys Cys Glu Gly Tyr lie Tyr Ala 
380 

Gly Gly Glu Leu Asn Arg Arg Thr Val 
395 

Lys Asp Glu Trp Thr Met Val Ser Pro 
410 

Trp Ser Ala Ala Val Val Val His Asp 
425 

Leu Asn Leu Met Tyr Cys Tyr Phe Pro 
440 

Glu Met Ala Met Arg Gin Thr Ser Arg 
455 

Ala Phe Gly Asp Lys lie Phe Tyr lie 
470 

Thr Asn Ser Gly lie Arg Leu Pro Ser 
485 

Ser Val Thr Val Glu lie Tyr Asp Val 
500 

Met Ala Ala Asn lie Pro Ala Lys Arg 
515 

Arg Ala Val Val lie Ser Asn Ser Leu 
530 

Thr His Leu Asn Glu Arg Ala Lys Tyr 
545 

Leu Glu Leu Asp Arg Trp Ser Leu Arg 
560 

Val Leu Trp Asp Leu Gly Arg Asp Phe 
575 

Leu Tyr Pro Ser Cys Leu Glu Glu Ser 
590 

Tyr Leu Phe Ser Thr Asp Gly Thr Glu 
605 

Glu Met Val Ala Leu Pro Pro Val 
620 

<210> 2 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1455669CD1 

<400> 2 

Met Ala Ala Pro Ala Glu Pro Cys Ala 
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205 










210 


lie 


Arcr 


lie 


Asp 


Ala 


Leu 


220 










225 


Gin 


Gly 


Leu 


Pro 


Pro 


Asn 


235 










240 


Tvr 


Lys 


Ser 


Met 


Pro 


Lys 


250 










255 


Lys 


Glu 


Glu 


Met 


Met 


He 


265 










270 


Cys 


Ser 
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He 
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Arg 
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Thr 


Val 
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Lys 
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Trp 
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Pro 


Thr 
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Glu 


Phe 
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Gly Gin Gly Val Trp Asn 
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15 10 15 



Gin 


Thr 


Glu 


Pro 
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20 
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Ala 


Ala 


Thr 
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Leu 
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Gly 


Arg 
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Gly 


Tyr 


Leu 


Arg 


Met 
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Pro 


Leu 


Phe 


Lys 


Ala 


Lys 


Met 


Val 


Ala 
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70 








75 


Ala 


He 
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Leu 
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Asn 


Val 
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Arg 
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Thr 
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Trp 
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Val 


Lys 


Ser 















95 

< 210> 3 

<211> 374 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2084989CD1 
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Leu 
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35 
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Leu 
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He 
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Gin 
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Gly 










50 
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Trp 
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Val 
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Thr 
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Trp 


Val 
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Ser 


Phe 
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He 


Val 
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Arg 
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Gly 
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Thr 
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Arg 


Gin 


Leu 
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Thr 
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Glu 


Leu 
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Val 


Gly 
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Pro 


Val 
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Gly 
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Ser 


Val 


Thr 
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Pro 
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Leu 


He 
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Lys 


Asp 


Phe 
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Lys 


Val 
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Val 
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270 


Glu 


Trp 


Ala 


Met 


Ala 


Gly 


Arg 


Ser 


Asp 


Gin 


Ala 


Pro 


Thr 


Leu 


Ser 
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335 340 345 

Thr Gin Lys Gly Thr Val He Val Met Leu Asp Leu Met Leu He 

350 355 , 360 

Leu Leu Pro Pro Ser Ala Ser He Leu Arg Gly Thr His Gly 

365 370 

<210> 4 
<211> 271 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2501034CD1 

<400> 4 
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Lys 


Gly 


Asn 


1 








5 










10 










15 


vax 


Asp 




V Ct-L 


Ala 


Ala 


Thr 


Pro 


Thr 


Ala 


Ala 


Ser 


Ala 


Ser 
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Trp 


Val 
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Phe 


He 


Leu 
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180 


Lys 


Ala 


Leu 


Leu 


Leu 


Ser 


Ser 


Tyr 


Gly 
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He 


Pro 
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Val 


He 


Trp 


Glu 


His 


Asp 


Tyr 


Thr 
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Val 
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Leu 


Lys 


Leu 
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205 










210 


He 


Lys 


Val 


Phe 


Val 


Leu 


Thr 


Ser 


Asn 


Phe 


Gin 


Ala 


He 


Arg 


Val 










215 










220 
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Thr 


Leu 


Asn 


He 


Asn 


Arg 


Lys 


Leu 


Ser 


Phe 


Leu 


Ala 


Val 


Leu 


Ser 










230 
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Gly 
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Glu 
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Met 


Val 
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260 265 270 

Phe 

<210> 5 
<211> 323 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2745212CD1 

<400> 5 

Met Ala Pro Lys Gin Asp Pro Lys Pro Lys Phe Gin Glu Gly Glu 

15 10 15 

Arg Val Leu Cys Phe His Gly Pro Leu Leu Tyr Glu Ala Lys Cys 

20 25 30 

Val Lys Val Ala He Lys Asp Lys Gin Val Lys Tyr Phe He His 
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35 40 45 

Tyr Ser Gly Trp Asn Lys Asn Trp Asp Glu Trp Val Pro Glu Ser 
50 55 60 

Arg Val Leu Lys Tyr Val Asp Thr Asn Leu Gin Lys Gin Arg Glu 
65 70 75 

Leu Gin Lys Ala Asn Gin Glu Gin Tyr Ala Glu Gly Lys Met Arg 
80 85 90 

Gly Ala Ala Pro Gly Lys Lys Thr Ser Gly Leu Gin Gin Lys Asn 
95 100 105 

Val Glu Val Lys Thr Lys Lys Asn Lys Gin Lys Thr Pro Gly Asn 
HO us 12 o 

Gly Asp Gly Gly Ser Thr Ser Glu Thr Pro Gin Pro Pro Arg Lys 
125 130 135 

L ys Arg Ala Arg Val Asp Pro T hr Val Glu Asn nin m n Thr_phe_ 

140 145 150 

Met Asn Arg Val Glu Val Lys Val Lys lie Pro Glu Glu Leu Lys 
155 160 165 

Pro Trp Leu Val Asp Asp Trp Asp Leu lie Thr Arg Gin Lys Gin 
170 175 180 

Leu Phe Tyr Leu Pro Ala Lys Lys Asn Val Asp Ser lie Leu Glu 
185 190 195 

Asp Tyr Ala Asn Tyr Lys Lys Ser Arg Gly Asn Thr Asp Asn Lys 
200 205 210 

Glu Tyr Ala Val Asn Glu Val Val Ala Gly lie Lys Glu Tyr Phe 
215 220 225 

Asn Val Met Leu Gly Thr Gin Leu Leu Tyr Lys Phe Glu Arg Pro 
230 235 240 

Gin Tyr Ala Glu lie Leu Ala Asp His Pro Asp Ala Pro Met Ser 
24 5 250 255 

Gin Val Tyr Gly Ala Pro His Leu Leu Arg Leu Phe Val Arg He 
260 265 270 

Gly Ala Met Leu Ala Tyr Thr Pro Leu Asp Glu Lys Ser Leu Ala 
275 280 285 

Leu Leu Leu Asn Tyr Leu His Asp Phe Leu Lys Tyr Leu Ala Lys 
290 295 300 

Asn Ser Ala Thr Leu Phe Ser Ala Ser Asp Tyr Glu Val Ala Pro 
305 310 315 

Pro Glu Tyr His Arg Lys Ala Val 
320 

<210> 6 

<211> 274 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4833111CD1 
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145 
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Trp 
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Leu 


Trp 


Ser 
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Trp 


Arg 


Gin 


Asp 


His 
175 


Val 


Val 


Asp 


Ser 


Trp 
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Ser 


Ala 


Val 


Cys 
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Gly 


Ala 


Phe 


Ala 
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Gly 
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Ala 
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Val 
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Thr 
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Leu 
200 


Asp 
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205 


Arg 


He 


Thr 
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Ala 
210 


Lys 
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Ser 
215 


Thr 


Ala 


Asp 


Gly 


Asn 
220 


Val 


Leu 


Ser 


Val 


Leu 
225 


His 


Gly 


Val 


Trp 


Arg 
230 


Ser 


Gin 


Gly 


Leu 


Ala 
235 


Gly 


Leu 


Phe 


Ala 


Gly 
240 


Val 


Phe 


Pro 


Arg 


Met 
245 


Ala 


Ala 


He 


Ser 


Leu 
250 


Gly 


Gly 


Phe 


He 


Phe 
255 


Leu 


Gly 


Ala 


Tyr 


Asp 
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Arg 
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His 


Ser 


Leu 
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Glu 


Val 


Gly 
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Lys 
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<210> 7 
<211> 291 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 876677CD1 

<400> 7 
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Val 


Ser 
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Glu 


Asn 


1 








5 










10 










15 


Phe 


Val 
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Val 
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Lys 


Arg 
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Gly 


Val 
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Leu 
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Ser 
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Val 
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Ala 
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Arg 
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Ala 
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Ala 


Lys 


Gly 
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Val 
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Pro 
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He 


Asp 


Val 


Phe 


Val 
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Val 


Asp 
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Arg 


Thr 


Val 


Thr 


Cys 


Asn 
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Pro 


Pro 


Gin 


Glu 


He 
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105 


Leu 


Thr 


Arg 


Asp 


Ser 


Val 


Thr 


Thr 


Gin 


Val 


Asp 
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Val 


Val 


Tyr 
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115 










120 


Tyr 


Arg 


He 


Tyr 


Ser 


Ala 


Val 


Ser 


Ala 


Val 


Ala 


Asn 


Val 


Asn 
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130 
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Thr 


Phe 


Leu 


Leu 


Ala 


Gin 


Thr 
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Leu 


Arg 
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145 
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Val 


Leu 
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Thr 


Gin 


Thr 


Leu 


Ser 


Gin 
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Leu 


Ala 


Gly 


Arg 


Glu 
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165 
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He 


Ala 
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He 


Gin 


Thr 
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Asp 


Asp 


Ala 


Thr 


Glu 










170 










175 










180 


Leu 


Trp 


Gly 


He 


Arg 


Val 


Ala 


Arg 


Val 


Glu 


He 


Lys 


Asp 


Val 


Arg 
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Val 


Gin 


Leu 


Gin 


Arg 
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Met 


Ala 


Ala 


Glu 


Ala 


Glu 


Ala 
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Thr 
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Glu 
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Arg 


Ala 


Lys 


Val 


Leu 


Ala 


Ala 


Glu 


Gly 


Glu 


Met 
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Asn 


Ala 
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Ser 


Leu 


Lys 


Ser 


Ala 


Ser 


Met 
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Leu 


Ala 


Glu 
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Gin 


Leu 
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Gin 


Thr 


Leu 
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Thr 
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Val 


Ala 


Thr 


Glu 


Lys 


Asn 


Ser 


Thr 


He 


Val 


Phe 


Pro 


Leu 


Pro 


Met 
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270 


Asn 


He 


Leu 


Glu 


Gly 


He 


Gly 


Gly 


Val 


Ser 


Tyr 


Asp 


Asn 


His 


Lys 
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285 


Lys 


Leu 


Pro 


Asn 
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Ala 
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290 

<210> 8 

<211> 381 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2326i43CDl 



<400> 8 
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Arg 
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Val 
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Ala 
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Gly 
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Arg 
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uiy 


m 1 1 

ulU 
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Thr 
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Pro 
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Q C 
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Thr 


Glu 

O A 


Thr 


Ser 


Leu 


Gin 


Glu 
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-7 -J 
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Val 


Thr 


Ser 


Glu 
1 c\f\ 

1UU 
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Ala 


Glu 


Thr 


Ser 

i nc 
1UD 


Thr 


Glu 


Val 


lie 


Cys 

X ±\J 


Pro 


He 


Cys 


Ala 


Ala 


Leu 


Pro 


Gly 


Gly 


Asp 
i on 


Pro 


Asn 


His 


Val 


Thr 

X £* J 


Asp 


Asp 


Phe 


Ala 


Ala 


His 


Leu 


Thr 


Leu 


Glu 


His 


Arg 


Ala 


Pro 


Arg 

14 0 

X *± \J 


Asp 


Leu 


Asp 


Glu 


Ser 

X 44 D 


Ser 


Gly 


Val 


Arg 


His 

IjU 


Val 


Arg 


Arg 


Met 


Phe 

1 

J — J J 


His 


Pro 


Gly 


Arg 


Gly 

1DU 


Leu 


Gly 


Gly 


Pro 


Arg 

1 

± O D 


Ala 


Arg 


Arg 


Ser 


Asn 
170 


Met 


His 


Phe 


Thr 


Ser 

1 7R 

X / D 


Ser 


Ser 


Thr 


Gly 


Gly 
1 on 


Leu 


Ser 


Ser 


Ser 


Gin 
185 


Ser 


Ser 


Tyr 


Ser 


Pro 
i qo 


Ser 


Asn 


Arg 


Glu 


Ala 


Met 


Asp 


Pro 


He 


Ala 
200 


Glu 


Leu 


Leu 


Ser 


Gin 

U J 


Leu 


Ser 


Gly 


Val 


910 


Arg 


Ser 


Ala 


Gly 


Gly 


Gin 


Leu 


Asn 


Ser 


Ser 


Gly 


Pro 


Ser 


Ala 


Ser 










215 










220 








225 


Gin 


Leu 


Gin 


Gin 


Leu 
230 


Gin 


Met 


G In 


Leu 


\j Xll 

235 


Leu 


pin 


Arg 


\j XLk 


nib 
240 


Ala 


Gin 


Ala 


Ala 


Arg 
245 


Gin 


Gin 


Leu 


Glu 


Thr 
250 


Ala 


Arg 


Asn 


Ala 


Thr 
255 


Arg 


Arg 


Thr 


Asn 


Thr 
260 


Ser 


Ser 


Val 


Thr 


Thr 
265 


Thr 


He 


Thr 


Gin 


Ser 
270 


Thr 


Ala 


Thr 


Thr 


Asn 
275 


He 


Ala 


Asn 


Thr 


Glu 
280 


Ser 


Ser 


Gin 


Gin 


Thr 
285 


Leu 


Gin 


Asn 


Ser 


Gin 
290 


Phe 


Leu 


Leu 


Thr 


Arg 
295 


Leu 


Asn 


Asp 


Pro 


Lys 
300 


Met 


Ser 


Glu 


Thr 


Glu 
305 


Arg 


Gin 


Ser 


Met 


Glu 
310 


Ser 


Glu 


Arg 


Ala 


Asp 
315 


Arg 


Ser 


Leu 


Phe 


Val 
320 


Gin 


Glu 


Leu 


Leu 


Leu 
325 


Ser 


Thr 


Leu 


Val 


Arg 
330 


Glu 


Glu 


Ser 


Ser 


Ser 
335 


Ser 


Asp 


Glu 


Asp 


Asp 
340 


Arg 


Gly 


Glu 


Met 


Ala 
345 


Asp 


Phe 


Gly Ala 


Met 


Gly 


Cys 


Val 


Asp 


He 


Met 


Pro 


Leu 


Asp 


Val 










350 










355 










360 


Ala 


Leu 


Glu 


Asn 


Leu 
365 


Asn 


Leu 


Lys 


Glu 


Ser 
370 


Asn 


Lys 


Gly 


Asn 


Glu 
375 


Pro 


Pro 


Pro 


Pro 


Pro 


Leu 





















380 
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<211> 190 

<212> PRT 

<213> Homo sapiens 



7/60 



BNSOOCID: <WO 0078953A2 I > 



WO 00/78953 



PCT/US00/16668 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2786302CD1 



<400> 9 




























Met 


Lys 


Tyr 


Gly 


Asn 


Glu 


He 


Met 


Asn 


Lys 


Asp 


Pro 


Val 


Phe 


Arg 


1 








5 










10 










15 


He 


Ser 


Pro 


Arg 


Ser 
20 


Arg 


Glu 


Thr 


His 


Pro 
25 


Asn 


Pro 


Glu 


Glu 


Pro 
30 


Glu 


Glu 


Glu 


Asp 


Glu 


Asp 


Val 


Gin 


Ala 


Glu 


Arg 


Val 


Gin 


Ala 


Ala 










35 










40 








45 


Asn 


Ala 


Leu 


Thr 


Ala 
50 


Pro 


Asn 


Leu 


Glu 


Glu 
55 


Glu 


Pro 


Val 


He 


Thr 
60 


Ala 


Ser 


Cys 


Leu 


His 
65 


Lys 


Glu 


Tyr 


Tyr 


Glu 
70 


Thr 


Lys 


Lys 


Ser 


Cys 
75 


Phe 


Ser 


Thr 


Arg 


Lys 
80 


Lys 


Lys 


He 


Ala 


He 
85 


Arg 


Asn 


Val 


Ser 


Phe 
90 


— _? — * 


Val 


T •« r«~ 


T t »*— 

J-l Jf !9 


T , , 

95 


Glu 


Val 


Leu 


Gly 


Leu 
100 


Leu 


Gly 


His 


Asn 


Gly 
105 


Ala 


Gly 


Lys 


Ser 


Thr 
110 


Ser 


He 


Lys 


Met 


He 
115 


Thr 


Gly 


Cys 


Thr 


Lys 
120 


Pro 


Thr 


Ala 


Gly 


Val 
125 


Val 


Val 


Leu 


Gin 


Gly 
130 


Ser 


Arg 


Ala 


Ser 


Val 
135 


Arg 


Gin 


Gin 


His 


Asp 
140 


Asn 


Ser 


Leu 


Lys 


Phe 
145 


Leu 


Gly 


Tyr 


Cys 


Pro 
150 


Gin 


Glu 


Asn 


Ser 


Leu 


Trp 


Pro 


Lys 


Leu 


Thr 


Met 


Lys 


Glu 


His 


Leu 










155 










160 








165 


Glu 


Leu 


Tyr 


Ala 


Ala 
170 


Val 


Glu 


Arg 


Leu 


Gly 
175 


Gin 


Lys 


Arg 


Cys 


Cys 
180 


Ser 


Gin 


Tyr 


Phe 


Thr 
185 


He 


Gly 


Gly 


Arg 


Ser 
190 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 3735780CD1 

<400> 10 



Met 


Met 


Asp 


Ser 


Glu 


Ala 


His 


Glu 


Lys 


Arg 


Pro 


Pro 


He 


Leu 


Thr 


1 








5 










10 










15 


Ser 


Ser 


Lys 


Gin 


Asp 


He 


Ser 


Pro 


His 


He 


Thr 


Asn 


Val 


Gly 


Glu 










20 










25 








30 


Met 


Lys 


His 


Tyr 


Leu 


Cys 


Gly 


Cys 


Cys 


Ala 


Ala 


Phe 


Asn 


Asn 


Val 










35 










40 










45 


Ala 


He 


Thr 


Phe 


Pro 


He 


Gin 


Lys 


Val 


Leu 


Phe 


Arg 


Gin 


Gin 


Leu 










50 










55 








60 


Tyr 


Gly 


He 


Lys 


Thr 


Arg 


Asp 


Ala 


He 


Leu 


Gin 


Leu 


Arg 


Arg 


Asp 










65 










70 










75 


Gly 


Phe 


Arg 


Asn 


Leu 


Tyr 


Arg 


Gly 


He 


Leu 


Pro 


Pro 


Leu 


Met 


Gin 










80 










85 










90 


Lys 


Thr 


Thr 


Thr 


Leu 


Ala 


Leu 


Met 


Phe 


Gly 


Leu 


Tyr 


Glu 


Asp 


Leu 










95 










100 










105 


Ser 


Cys 


Leu 


Leu 


His 


Lys 


His 


Val 


Ser 


Ala 


Pro 


Glu 


Phe 


Ala 


Thr 










110 










115 










120 


Ser 


Gly 


Val 


Ala 


Ala 


Val 


Leu 


Ala 


Gly 


Thr 


Thr 


Glu 


Ala 


He 


Phe 










125 










130 










135 


Thr 


Pro 


Leu 


Glu 


Arg 


Val 


Gin 


Thr 


Leu 


Leu 


Gin 


Asp 


His 


Lys 


His 


His 








140 










145 








150 


Asp 


Lys 


Phe 


Thr 


Asn 


Thr 


Tyr 


Gin 


Ala 


Phe 


Lys 


Ala 


Leu 


Lys 










155 










160 










165 


Cys 


His 


Gly 


He 


Gly 


Glu 


Tyr 


Tyr 


Arg 


Gly 


Leu 


Val 


Pro 


He 


Leu 


Phe 








170 










175 










180 


Arg 


Asn 


Gly 


Leu 


Ser 


Asn 


Val 


Leu 


Phe 


Phe 


Gly 


Leu 


Arg 


Gly 










185 










190 










195 


Pro 


He 


Lys 


Glu 


His 


Leu 


Pro 


Thr 


Ala 


Thr 


Thr 


His 


Ser 


Ala 


His 
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200 205 210 



Leu 


Val 


Asn 


Asp 


Phe 


lie 


Cys 


Gly 


Gly 


Leu 


Leu 


Gly 


Ala 


Met 


Leu 










215 










220 










225 


Gly 


Phe 


Leu 


Phe 


Phe 


Pro 


lie 


Asn 


Val 


Val 


Lys 


Thr 


Arg 


lie 


Gin 










230 










235 










240 


Ser 


Gin 


lie 


Gly 


Gly 


Glu 


Phe 


Gin 


Ser 


Phe 


Pro 


Lys 


Val 


Phe 


Gin 










245 










250 










255 


Lys 


lie 


Trp 


Leu 


Glu 


Arg 


Asp 


Arg 


Lys 


Leu 


lie 


Asn 


Leu 


Phe 


Arg 










260 










265 










270 


Gly 


Ala 


His 


Leu 


Asn 


Tyr 


His 


Arg 


Ser 


Leu 


lie 


Ser 


Trp 


Gly 


lie 










275 










280 










285 


lie 


Asn 


Ala 


Thr 


Tyr 


Glu 


Phe 


Leu 


Leu 


Lys 


Val 


lie 









290 295 
<210> 11 

<21T^89 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 039026CD1 

<400> 11 



Met 


Ala 


Ala 


Gin 


He 


Pro 


Glu 


Ser 


1 








5 








Glu 


Phe 


Leu 


Gly 


Thr 
20 


Tyr 


Asn 


Lys 


Asp 


Cys 


Val 


Lys 


Asp 
35 


Phe 


Thr 


Thr 


Thr 


Thr 


Cys 


Ser 


Glu 
50 


His 


Cys 


Leu 


Gin 


Arg 


lie 


Ser 


Met 
65 


Arg 


Phe 


Gin 


Glu 


Ala 


Leu 


Ala 


Ala 
80 


Lys 


Ala 


Gly 



<210> 12 

<211> 115 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 260607CD1 



Asp 


Gin 


He 


Lys 


Gin 


Phe 


Lys 




10 










15 


Leu 


Thr 


Glu 


Thr 


Cys 


Phe 


Leu 




25 










30 


Arg 


Glu 


Val 


Lys 


Pro 


Glu 


Glu 




40 










45 


Gin 


Lys 


Tyr 


Leu 


Lys 


Met 


Thr 




55 










60 


Glu 


Tyr 


His 


lie 


Gin 


Gin 


Asn 




70 










75 


Leu 


Leu 


Gly 


Gin 


Pro 


Arg 





85 



Met 


Ala 


Leu 


He 


Pro 


Ser 


Arg 


Val 


Trp 


Leu 


Pro 


Phe 


Ala 


Val 


Trp 


1 








5 










10 










15 


Val 


Val 


Asp 


Ser 


Ala 


Pro 


Val 


Arg 


Gly 


Leu 


Val 


Arg 


Arg 


Glu 


Pro 










20 










25 










30 


Phe 


Leu 


Arg 


Thr 


Gly 


Ser 


Phe 


He 


Ala 


Leu 


Phe 


Tyr 


Phe 


Pro 


Pro 










35 










40 










45 


Leu 


Leu 


Pro 


Val 


Leu 


He 


Asn 


Leu 


Phe 


Ser 


Phe 


Phe 


Leu 


Thr 


Pro 










50 










55 










60 


Ser 


Phe 


Trp 


Arg 


Gin 


Leu 


Gly Ala 


He 


Leu 


Val 


Tyr 


Ala 


Ser 


Leu 










65 










70 










75 


Leu 


Ala 


Glu 


Lys 


Thr 


Pro 


Phe 


Lys 


Thr 


Gin 


Arg 


Thr 


Leu 


Glu 


Gly 










80 










85 










90 


Asp 


Ala 


Leu 


Val 


Gly 


Ser 


Val 


Ser 


He 


Phe 


Leu 


Cys 


Ala 


Lys 


Asp 










95 










100 










105 


Arg 


Gin 


Thr 


Glu 


Ala 


Glu 


Arg 


Gly 


Cys 


Ser 













110 115 
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<223> Incyte ID No: 1429651CD1 
<400> 13 



Met 


Glu 


Ser 


Gly 


Thr 


Ser 


Ser 


Pro 


Gin 


Pro 


Pro 


Gin 


Leu 


Asp 


Pro 


1 








5 










10 










15 


Leu 


Asp 


Ala 


Phe 


Pro 
20 


Gin 


Lys 


Gly 


Leu 


Glu 
25 


Pro 


Gly 


Asp 


He 


Ala 
30 


Val 


Leu 


Val 


Leu 


Tyr 
35 


Phe 


Leu 


Phe 


Val 


Leu 
40 


Ala 


Val 


Gly 


Leu 


Trp 
45 


Ser 


Thr 


Val 


Lys 


Thr 
50 


Lys 


Arg 


Asp 


Thr 


Val 
55 


Lys 


Gly 


Tyr 


Phe 


Leu 
60 


Ala 


Gly 


Gly 


Asp 


Met 


Val 


Trp 


Trp 


Pro 


Val 


Gly Ala 


Ser 


Leu 


Phe 










65 










70 










75 


Ala 


Ser 


Asn 


Val 


Gly 
80 


Ser 


Gly 


His 


Phe 


He 
85 


Glv 


Leu 


Ala 


Gly 


Ser 
90 


Gly 


Ala 


Ala 


Thr 


Gly 
95 


He 


Ser 


Val 


Ser 


Ala 
100 


Tvr 


Glu 


Leu 


Asn 


Gly 
105 


Leu 


Phe 


Ser 


Val 


Leu 
110 


Met 


Leu 


Ala 


Trp 


He 
115 


Phe 


Leu 


Pro 


He 


Tyr 
120 


He 


Ala 


Gly 


Gin 


Val 
125 


Thr 


Thr 


Met 


Pro 


Glu 
130 


ryr 




Arg 


Lys 


Arg 
135 


Phe 


Gly 


Gly 


He 


Arg 
140 


He 


Pro 


He 


He 


Leu 
145 


Ala 


Val 


Leu 


Tyr 


Leu 
150 


Phe 


He 


Tyr 


He 


Phe 
155 


Thr 


Lys 


He 


Ser 


Val 
160 


Asp 


Met 


Tyr 


Ala 


Gly 
165 


Ala 


He 


Phe 


He 


Gin 
170 


Gin 


Ser 


Leu 


His 


Leu 
175 


Asp 




Tyr 


Leu 


Ala 
180 


He 


Val 


Gly 


Leu 


Leu 
185 


Ala 


He 


Thr 


Ala 


Val 
190 


i yx 


Thr 


Val 


Ala 


Gly 
195 


Gly 


Leu 


Ala 


Ala 


Val 
200 


He 


Tyr 


Thr 


Asp 


Ala 
205 


Leu 


Gin 


Thr 


Leu 


He 
210 


Met 


Leu 


He 


Gly 


Ala 
215 


Leu 


Thr 


Leu 


Met 


Gly 
220 


TH/r 


C £a t— 


Phe 


Ala 


Ala 
225 


Val 


Gly 


Gly 


Met 


Glu 
230 


Gly 


Leu 


Lys 


Glu 


Lys 
235 


i y i 


Phe 


Leu 


Ala 


240 


Ala 


Ser 


Asn 


Arg 


Ser 
245 


Glu 


Asn 


Ser 


Ser 


Cys 
250 


Gly 


Leu 


Pro 


Arg 


Glu 
255 


Asp 


Ala 


Phe 


His 


He 
260 


Phe 


Arg 


Asp 


Pro 


Leu 
265 


Thr 


Ser 


Asp 


Leu 


Pro 
270 


Trp 


Pro 


Gly 


Val 


Leu 
275 


Phe 


Gly 


Met 


Ser 


He 
280 


Pro 


Ser 


Leu 


Trp 


Tyr 
285 


Trp 


Cys 


Thr 


Asp 


Gin 
290 


Val 


He 


Val 


Gin 


Arg 
295 


Thr 


Leu 


Ala 


Ala 


Lys 
300 


Asn 


Leu 


Ser 


His 


Ala 
305 


Lys 


Gly 


Gly 


Ala 


Leu 
310 


Met 


Ala 


Ala 


Tyr 


Leu 
315 


Lys 


Val 


Leu 


Pro 


Leu 
320 


Phe 


He 


Met 


Val 


Phe 
325 


Pro 


Gly 


Met 


Val 


Ser 
330 


Arg 


He 


Leu 


Phe 


Pro 
335 


Asp 


Gin 


Val 


Ala 


Cys 
340 


Ala 


Asp 


Pro 


Glu 


He 
345 


Cys 


Gin 


Lys 


He 


Cys 
350 


Ser 


Asn 


Pro 


Ser 


Gly 
355 


Cys 


Ser 


Asp 


He 


Ala 
360 


Tyr 


Pro 


Lys 


Leu 


Val 
365 


Leu 


Glu 


Leu 


Leu 


Pro 
370 


Thr 


Gly 


Leu 


Arg 


Gly 
375 


Leu 


Met 


Met 


Ala 


Val 
380 


Met 


Val 


Ala 


Ala 


Leu 
385 


Met 


Ser 


Ser 


Leu 


Thr 
390 


Ser 


He 


Phe 


Asn 


Ser 
395 


Ala 


Ser 


Thr 


He 


Phe 
400 


Thr 


Met 


Asp 


Leu 


Trp 
405 


Asn 


His 


Leu 


Arg 


Pro 
410 


Arg 


Ala 


Ser 


Glu 


Lys 
415 


Glu 


Leu 


Met 


He 


Val 
420 


Gly 


Arg 


Val 


Phe 


Val 
425 


Leu 


Leu 


Leu 


Val 


Leu 
430 


Val 


Ser 


He 


Leu 


Trp 
435 


He 


Pro 


Val 


Val 


Glil 
440 


-» T _ 


Ser 


Gin 


Giy 


Gly 
445 


Gin 


Leu 


Phe 


He 


Tyr 
450 


He 


Gin 


Ser 


He 


Ser 
455 


Ser 


Tyr 


Leu 


Gln 


Pro 
460 


Pro 


Val 


Ala 


Val 


Val 
465 


Phe 


He 


Met 


Gly 


Cys 
470 


Phe 


Trp 


Lys 


Arg 


Thr 
475 


Asn 


Glu 


Lys 


Gly 


Ala 
480 


Phe 


Trp 


Gly 


Leu 


He 


Ser 


Gly 


Leu 


Leu 


Leu 


Gly 


Leu 


Val 


Arg 


Leu 
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Val 








4 AS 










4 y u 










495 


Leu 


A 

sp 




Tip 
lie 


J- y r 


va i 


o±n 


Pro 


Arg 


Cys 


Asp 


Gin 


Pro 


Asp 


Glu 








500 










DUD 










510 


Arg 


Pro 


Val 


Leu 


Val 


Lys 




lie 


His 


Tyr 


Leu 


Tyr 


Phe 


Ser 










ji j 










c T A 

DZU 








525 


Met 


lie 


Leu 


Ser 


Thr 


val 


i nr 


Leu 


lie 


Thr 


Val 


Ser 


Thr 


Val 


Ser 










530 










ETC 

DJD 










540 


Trn 


Phe 


Thr* 


Glu 


Pro 


Pro 




Lys 


CalU 


Met 


Val 


Ser 


His 


Leu 


Thr 










545 










^ ^ n 

j DU 










555 


Trp 


Phe 


Thr 


Ar*g 


His 


_ 

Asp 


Pro 


vai 


Val 


Gin 


Lys 


Glu 


Gin 


Ala 


Pro 










560 










r r c 

bob 








570 


Pro 


Ala 


Ala 


XT J. KJ 


Leu 


Ser 


Leu 


inr 


Leu 


Ser 


Gin 


Asn 


Gly 


Met 


Pro 


Glu 


Ala 






575 










c o r\ 








585 


Ser 




Ser 


Ser 


oer 


val 


Gin 


_Phe_ 


_Glu- 


.Met- 


-Val- 


-Gln- 


-G-lu- 




Thr 






590 










CQC 

3 y b 










600 


Asn 


Ser 


Lys 


Thr 


His 


Ser 




Asp 




i nr 


Pro 


Lys 


Gin 


Ser 




Val 






605 










610 








615 


Lys 


Val 


Lys 


Ala 


He 


Leu 


Trp 


Leu 


Cys 


Gly 


He 


Gin 


Glu 


Lys 


Gly 








620 










625 










630 


Lys 


Glu 


Glu 


Leu 


Pro 


Ala 


Arg 


Ala 


Glu 


Ala 


He 


He 


Val 


Ser 




Glu 






635 










640 










645 


Leu 


Glu 


Asn 


Pro 


Leu 


Val 


Lys 


Thr 


Leu 


Leu 


Asp 


Val 


Asn 


Leu 


He 


Phe 






650 










655 








660 


Cys 


Val 


Ser 


Cys 


Ala 


He 


Phe 


He 


Trp 


Gly 


Tyr 


Phe 


Ala 










665 










670 








675 
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<400> 14 



Met 


Tyr 


His 


Cys 


His 


Ser 


Gly 


1 








5 




Asn 


Glu 


Tyr 


Ala 


Tyr 
20 


Ala 


Lys 


He 


Cys 


Phe 


He 


Phe 
35 


Met 


He 


Ala 


Gly 


Ser 


Leu 


Ala 
50 


Val 


Val 


Asp 


Leu 


Thr 


Ser 


Phe 
65 


Leu 


Leu 


Ser 


Lys 


Pro 


Pro 


Ser 


Lys 


Arg 










80 




Glu 


He 


Leu 


Gly 


Ala 
95 


Leu 


Leu 


Thr 


Gly 


Val 


Leu 


Val 
110 


Tyr 


Leu 


Asp 


Tyr 


Gin 


He 


Gin 
125 


Ala 


Thr 


Ala 


Val 


Ala 


Ala 


Asn 
140 


He 


Val 


Cys 


Leu 


Gly 


His 


Asn 
155 


His 


Lys 


Arg 


Ala 


Ala 


Phe 


Val 
170 


His 


Ala 


Ser 


Val 


Leu 


He 


Ser 
185 


Ala 


Leu 


Lys 


He 


Ala 


Asp 
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Gly 


He 


Val 


Gly 


Val 


Phe 


He 








770 










775 








780 


He 


Phe 


Ala 


Val 


Phe 


Ala 


Val 


Leu 


Thr 


Val 


Ala 


He 


Leu 


Leu 


He 








785 










790 










795 


Met 


Glu 


Gly 


Leu 


Ser 


Ala 


Phe 


Leu 


His 


Ala 


Leu 


Arg 


Leu 


His 










800 










805 








810 


Trp 


Val 


Glu 


Phe 


Gin 


Asn 


Lys 


Phe 


Tyr 


Val 


Gly Asp 


Gly 


Tyr 


Lys 


Phe 








815 










820 










825 


Ser 


Pro 


Phe 


Ser 


Phe 


Lys 


His 


He 


Leu 


Asp 


Gly 


Thr 


Ala 


Glu 










830 










835 








840 


Glu 





























<210> 21 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3715961CD1 



<400> 21 



Met 


Ser 


Glu 


Cys 


Pro 


Leu 


He 


Leu 


Tyr 


He 


His 


Lys 


His 


He 


Asp 


1 








5 










10 










15 


Thr 


Tyr 


Ser 


Gin 


Ser 


Tyr 


Leu 


Phe 


Asn 


Asp 


Leu 


Phe 


Tyr 


Pro 


Val 










20 










25 








30 


Tyr 


Ser 


Gly 


Gly 


Arg 


Met 


Val 


Thr 


Tyr 


Glu 


His 


Leu 


Arg 


Glu 


Val 


Val 








35 










40 








45 


Phe 


Gly 


Lys 


Ser 


Glu 


Asp 


Glu 


His 


Tyr 


Pro 


Leu 


Trp 


Lys 


Ser 


Val 


He 






50 










55 






60 


Gly 


Gly 


Met 
65 


Met 


Ala 


Gly 


Val 


He 
70 


Gly 


Gin 


Phe 


Leu 


Ala 
75 


Asn 


Pro 


Thr 


Asp 


Leu 


Val 


Lys 


Val 


Gin 


Met 


Gin 


Met 


Glu 


Gly 


Lys 










80 










85 








90 


Arg 


Lys 


Leu 


Glu 


Gly 


Lys 


Pro 


Leu 


Arg 


Phe 


Arg 


Gly 


Val 


His 


His 


Ala 








95 










100 










105 


Phe 


Ala 


Lys 


He 


Leu 


Ala 


Glu 


Gly 


Gly 


He 


Arg 


Gly 


Leu 


Trp 


Ala 








110 










115 










120 


Gly 


Trp 


Val 


Pro 


Asn 


He 


Gin 


Arg 


Ala 


Ala 


Leu 


Val 


Asn 


Met 


Gly 








125 










130 










135 


Asp 


Leu 


Thr 


Thr 


Tyr 


Asp 


Thr 


Val 


Lys 


His 


Tyr 


Leu 


Val 


Leu 










140 










145 








150 


Asn 


Thr 


Pro 


Leu 


Glu 


Asp 


Asn 


He 


Met 


Thr 


His 


Gly 


Leu 


Ser 


Ser 










155 










160 








165 


Leu 


Cys 


Ser 


Gly 


Leu 
170 


Val 


Ala 


Ser 


He 


Leu 
175 


Gly 


Thr 


Pro 


Ala 


Asp 
180 
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Val lie Lys Ser Arg lie Met Asn Gin Pro Arg Asp Lys Gin Gly 

185 190 195 

Arg Gly Leu Leu Tyr Lys Ser Ser Thr Asp Cys Leu lie Gin Ala 

200 205 210 

Val Gin Gly Glu Gly Phe Met Ser Leu Tyr Lys Gly Phe Leu Pro 

215 220 225 

Ser Trp Leu Arg Met Thr Pro Trp Ser Met Val Phe Trp Leu Thr 

230 235 240 

Tyr Glu Lys lie Arg Glu Met Ser Gly Val Ser Pro Phe 

245 250 

<210> 22 
<211> 229 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5108194CD1 

<400> 22 



Met 


Gly 


Asn 


Gly 


Val 


Lys 


Glu 


Gly 


1 








5 






Ala 


Glu 


Ala 


Val 


Leu 
20 


Ser 


Ser 


Ser 


Arg 


Gin 


Val 


Leu 


Ser 
35 


Ser 


Leu 


Leu 


Leu 


Ala 


Lys 


Thr 


Ala 
50 


Val 


Ala 


Pro 


Phe 


Gin 


Val 


Ser 


Ser 
65 


Lys 


Arg 


Phe 


Val 


Leu 


Tyr 


Tyr 


Thr 
80 


Tyr 


Leu 


Asn 


Arg 


Gly 


Asn 


Ser 


Ala 
95 


Thr 


Met 


Val 


He 


Gin 


Phe 


Ser 


Ala 
110 


His 


Glu 


Glu 


Tyr 


Tyr 


Gly 


Phe 


Arg 
125 


Gly 


Glu 


Ala 


Phe 


Ala 


Gly 


Ala 


Leu 
140 


Ala 


Gly 


Thr 


Pro 


Leu 


Asp 


Leu 


Val 
155 


Arg 


Ala 


Arg 


Met 


Tyr 


Ser 


Asn 


lie 
170 


Phe 


His 


Val 


Glu 


Gly 


Leu 


Lys 


Thr 
185 


Leu 


Tyr 


His 


Gly 


Val 


He 


Pro 


Tyr 
200 


Ala 


Gly 


Leu 


Leu 


Lys 


Ser 


Leu 


His 


Arg 


Glu 


Tyr 










215 







Phe Ser Glu Gly 



Pro 


Val 


Arg 


Leu 


His 


Glu 


Asp 




10 










15 


Val 


Ser 


Ser 


Lys 


Arg 


Asp 


His 




25 










30 


Ser 


Gly 


Ala 


Leu 


Ala 


Gly Ala 




40 










45 


Leu 


Asp 


Arg 


Thr 


Lys 


He 


He 




55 










60 


Ser 


Ala 


Lys 


Glu 


Ala 


Phe 


Arg 




70 










75 


Glu 


Gly 


Phe 


Leu 


Ser 


Leu 


Trp 




85 










90 


Arg 


Val 


Val 


Pro 


Tyr 


Ala 


Ala 




100 










105 


Tyr 


Lys 


Arg 


He 


Leu 


Gly 


Ser 




115 










120 


Leu 


Pro 


Pro 


Trp 


Pro 


Arg 


Leu 




130 










135 


Thr 


Ala 


Ala 


Ser 


Leu 


Thr 


Tyr 




145 










150 


Met 


Ala 


Val 


Thr 


Pro 


Lys 


Glu 




160 










165 


Phe 


He 


Arg 


He 


Ser 


Arg 


Glu 




175 










180 


Gly 


Phe 


Met 


Pro 


Thr 


Val 


Leu 




190 










195 


Ser 


Phe 


Phe 


Thr 


Tyr 


Glu 


Thr 




205 










210 


Ser 


Gly 


Arg 


Lys 


Leu 


He 


Pro 




220 










225 



<210> 23 
<211> 170 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CD1 



<400> 23 

Met Tyr Asp Asn Leu Tyr Leu 

1 5 
Gly Ser Ala Asp Ser Tyr Thr 
20 

Ser Leu Glu Glu Asp Arg Glu 
35 



His Gly He Glu Asp Ser Glu Ala 
10 15 

Ser Arg Pro Ser Asp Ser Asp Val 
25 30 

Ala He Arg Gin Glu Arg Glu Gin 
40 45 



20/60 



3NSDOCID- ^WO 0078953A2 ' -> 



WO 00/78953 



PCT/US00/16668 



Gin 


Ala 


Ala 


lie 


Gin 


Leu 


Glu 


Arg 


Ala 


Lys 


ser 


Lys 


Pro 


Val 


Ala 


Phe 








50 










55 










60 


Ala 


Val 


Lys 


Thr 
65 


Asn 


Val 


Ser 


Tyr 


Cys 
70 


Gly 


Ala 


Leu 


Asp 


Glu 
75 


Asp 


Val 


Pro 


Val 


Pro 


Ser 


Thr 


Ala 


lie 


Ser 


Phe 


Asp 


Ala 


Lys 


Asp 


Phe 








80 










85 










90 


Leu 


His 


lie 


Lys 
95 


Glu 


Lys 


Tvr 


Asn 


Asn 
100 


Asp 




Trn 


lie 


Glv 
105 


Arg 


Leu 


Val 


Lys 


Glu 
110 


Gly 


Cys 


Glu 


lie 


Gly 
115 


Phe 


lie 


Pro 


Ser 


Pro 
120 


Leu 


Arg 


Leu 


Glu 


Asn 
125 


lie 


Arg 


lie 


Gin 


Gin 
130 


Glu 


Gin 


Lys 


Arg 


Gly 
135 


Arg 


Phe 


His 


Gly 


Gly 
140 


Lys 


Ser 


Ser 


Gly 


Asn 
145 


Ser 


Ser 


Ser 


Ser 


Leu 
150 


Gly- 


-Glu- 


-Met- 


-Val_ 


_Ser Gly 
155 


Thr 


Phe 


Arg 


Ala 
loo- 


Thr 


Pro 


Thr 


Ser 


Thr 

1-65- 



Gly Glu Gly Cys Ser 
170 

<210> 24 
<211> 655 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5517972CD1 

<400> 24 



Met 


Ser 


Ser 


O ^ -y- 


Asn 


Val 


\j J.U 


Val 


Phe 


I le 


Pro 


Val 


Ser 


Gin 


Gly 


1 








5 










10 










15 


Asn 


Thr 


Asn 


Glv 


Phe 


Pro 


Ala 


TTrr- 

-L I IX 


Air, 


o er 


Asn 


Asp 


Leu 


Lys 


a T -> 
A±a 










20 










25 








30 


Phe 


Thr 


Glu 


Gly 


Ala 


Val 


Leu 


Ser 


Phe 


His 


Asn 


He 


Cys 


Tyr 


Arg 










35 










. 40 








45 


Val 


Lys 


Leu 


Lys 


Ser 
50 


Gly 


Phe 


Leu 


Pro 


Cys 
55 


Arg 


Lys 


Pro 


Val 


Glu 
60 


Lys 


Glu 


He 


Leu 


Ser 
65 


Asn 


He 


Asn 


Gly 


He 
70 


Met 


Lys 


Pro 


Gly 


Leu 
75 


Asn 


Ala 


He 


Leu 


Gly 
80 


Pro 


Thr 


Gly 


Gly 


Gly 
85 


Lys 


Ser 


Ser 


Leu 


Leu 
90 


Asp 


Val 


Leu 


Ala 


Ala 
95 


Arg 


Lys 


Asp 


Pro 


Ser 
100 


Gly 


Leu 


Ser 


Gly 


Asp 
105 


Val 


Leu 


lie 


Asn 


Gly Ala 


Pro 


Arg 


Pro 


Ala 


Asn 


Phe 


Lys 


Cys 


Asn 










110 










115 








120 


Ser 


Gly 


Tyr 


Val 


Val 
125 


Gin 


Asp 


Asp 


Val 


Val 
130 


Met 


Gly 


Thr 


Leu 


Thr 
135 


Val 


Arg 


Glu 


Asn 


Leu 


Gin 


Phe 


Ser 


Ala 


Ala 


Leu 


Arg 


Leu 


Ala 


Thr 


Thr 








140 










145 








150 


Met 


Thr 


Asn 


His 


Glu 


Lys 


Asn 


Glu 


Arg 


He 


Asn 


Arg 


val 


He 


Gin 








155 










160 










165 


Glu 


Leu 


Gly 


Leu 


Asp 


Lys 


Val 


Ala 


Asp 


Ser 


Lys 


Val 


Gly 


Thr 










170 










175 








180 


Gin 


Phe 


He 


Arg 


Gly 


Val 


Ser 


Gly 


Gly 


Glu 


Arg 


Lys 


Arg 


Thr 


Ser 


He 








185 










190 










195 


Gly 


Met 


Glu 


Leu 
200 


He 


Thr 


Asp 


Pro 


Ser 
205 


He 


Leu 


Phe 


Leu 


Asp 
210 


Glu 


Pro 


Thr 


Thr 


Gly 
215 


Leu 


Asp 


Ser 


Ser 


Thr 
220 


Ala 


Asn 


Ala 


Val 


Leu 
225 


Leu 


Leu 


Leu 


Lys 


Arg 


Met 


Ser 


Lys 


Gin 


Gly 


Arg 


Thr 


He 


He 


Phe 




He 






230 










235 










240 


Ser 


His 


Gin 


Pro 


Arg 


Tyr 


Ser 


He 


Phe 


Lys 


Leu 


Phe 


Asp 


Ser 










245 










250 








255 


Leu 


Thr 


Leu 


Leu 


Ala 
260 


Ser 


Gly 


Arg 


Leu 


Met 
265 


Phe 


His 


Gly 


Pro 


Ala 
270 


Gin 


Glu 


Ala 


Leu 


Gly 
275 


Tyr 


Phe 


Glu 


Ser 


Ala 
280 


Gly 


Tyr 


His 


Cys 


Glu 
285 


Ala 


Tyr 


Asn 


Asn 


Pro 
290 


Ala 


Asp 


Phe 


Phe 


Leu 
295 


Asp 


He 


He 


Asn 


Gly 
300 
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Asp 


Ser 


Thr 


Ala 


Val 


Ala 


Leu 


Asn 


Arg 










305 








Thr 


Glu 


He 


He 


Glu 


Pro 


Ser 


Lys 


Gin 










320 








Lys 


Leu 


Ala 


Glu 


He 


Tyr 


Val 


Asn 


Ser 










335 










Lys 


Ala 


Glu 


Leu 


His 


Gin 


Leu 


Ser 


Gly 










350 








He 


Thr 


Val 


Phe 


Lys 


Glu 


lie 


Ser 


Tyr 










365 








Gin 


Leu 


Arg 


Trp 


Val 


Ser 


Lys 


Arg 


Ser 










380 










Asn 


Pro 


Gin 


Ala 


Ser 


He 


Ala 


Gin 


He 










395 










Gly 


Leu 


Val 


He 


Gly 


Ala 


He 


Tyr 


Phe 










410 










Thr 


Gly 


He 


Gin 


Asn 


Arg 


Ala 


Gly 


Val 










425 








Asn 


Gin 


Cys 


Phe 


Ser 


Ser 


Val 


Ser 


Ala 










440 










Glu 


Lys 


Lys 


Leu 


Phe 


He 


His 


Glu 


Tyr 










455 










Val 


Ser 


Ser 


Tyr 


Phe 


Leu 


Gly 


Lys 


Leu 










470 










Met 


Arg 


Met 


Leu 


Pro 


Ser 


lie 


He 


Phe 










485 










Met 


Leu 


Gly 


Leu 


Lys 


Pro 


Lys 


Ala 


Asp 










500 










Phe 


Thr 


Leu 


Met 


Met 


Val 


Ala 


Tyr 


Ser 










515 








Ala 


He 


Ala 


Ala 


Gly 


Gin 


Ser 


Val 


Val 










530 










Met 


Thr 


He 


Cys 


Phe 


Val 


Phe 


Met 


Met 










545 










Val 


Asn 


Leu 


Thr 


Thr 


He 


Ala 


Ser 


Trp 










560 








Phe 


Ser 


He 


Pro 


Arg 


Tyr 


Gly 


Phe 


Thr 










575 










Phe 


Leu 


Gly 


Gin 


Asn 


Phe 


Cys 


Pro 


Gly 










590 










Asn 


Pro 


Cys 


Asn 


Tyr 


Ala 


Thr 


Cys 


Thr 










605 










Lys 


Gin 


Gly 


He 


Asp 


Leu 


Ser 


Pro 


Trp 










620 








Val 


Ala 


Leu 


Ala 


Cys 


Met 


He 


Val 


He 










635 










Leu 


Lys 


Leu 


Leu 


Phe 


Leu 


Lys 


Lys 


Tyr 










650 










<210> 25 














<211> 184 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_f eature 










<223> Incyte ID 


No: 


5593114CD1 




<400> 25 














Met 


Trp 


Val 


Phe 


Gly 


Tyr 


Gly 


Ser 


Leu 


1 








5 










Pro 


Tyr 


Gin 


Asp 


Lys 


Leu 


Val 


Gly 


Tyr 










20 










Arg 


Phe 


Trp 


Gin 


Gly 


Ser 


Thr 


Asp 


His 










35 










Pro 


Gly 


Arg 


Val 


Val 


Thr 


Leu 


Val 


Glu 










50 










Trp 


Gly 


Val 


Ala 


Tyr 


Arg 


Leu 


Pro 


Val 










65 











PCT/US00/16668 



Glu 


Glu 


Asp 


Phe 


Lys 


Ala 


310 










315 


Asp 


Lys 


Pro 


Leu 


He 


Glu 


325 










330 


Ser 


Phe 


Tyr 


Lys 


Glu 


Thr 


340 










345 


Gly 


Glu 


Lys 


Lys 


Lys 


Lys 


355 










360 


Thr 


Thr 


Ser 


Phe 


Cys 


His 


370 










375 


Phe 


Lys 


Asn 


Leu 


Leu 


Gly 


385 










390 


He 


Val 


Thr 


Val 


Val 


Leu 


400 










405 


Gly 


Leu 


Lys 


Asn 


Asp 


Ser 


415 










420 


Leu 


Phe 


Phe 


Leu 


Thr 


Thr 


430 










435 


Val 


Glu 


Leu 


Phe 


Val 


Val 


445 










450 


He 


Ser 


Gly 


Tyr 


Tyr 


Arcr 


460 










465 


Leu 


Ser 


Asp 


Leu 


Leu 


Pro 


475 










480 


Thr 


Cys 


He 


Val 


Tyr 


Phe 


490 










495 


Ala 


Phe 


Phe 


Val 


Met 


Met 


505 










510 


Ala 


Ser 


Ser 


Met 


Ala 


Leu 


520 










525 


Ser 


Val 


Ala 


Thr 


Leu 


Leu 


535 










540 


He 


Phe 


Ser 


Glv 


Leu 


Leu 


550 










555 


Leu 


Ser 


Tro 


Leu 


Gin 


Tvr 


565 










570 


Ala 


Leu 


Gin 


His 


Asn 


Glu 


580 










585 


Leu 


Asn 


Ala 


Thr 


Glv 


Asn 


595 










600 


Gly 


Glu 


Glu 


Tyr 


Leu 


Val 


610 










615 


Gly 


Leu 


Trp 


Lys 


Asn 


His 


625 










630 


Phe 


Leu 


Thr 


He 


Ala 


Tyr 


640 










645 


Ser 












655 













He 


Trp 


Lys 


Val 


Asp 


Phe 


10 










15 


He 


Thr 


Asn 


Tyr 


Ser 


Arg 


25 










3 u 


Arg 


Gly 


Val 


Pro 


Gly 


Lys 


40 










45 


Asp 


Pro 


Ala 


Gly 


Cys 


Val 


55 










60 


Gly 


Lys 


Glu 


Glu 


Glu 


Val 


70 










75 
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Lys Ala Tyr 


Leu 


Asp 


Phe 


Arg 


Glu 


Lys 


Gly 


Gly 


Tyr 


Arg 


Thr 


Thr 






80 










85 








90 


Thr Val lie 


Phe 


Tyr 


Pro 


Lys 


Asp 


Pro 


Thr 


Thr 


Lys 


Pro 


Phe 


Ser 






95 










100 








105 


Val Leu Leu 


Tyr 


He 


Gly 


Thr 


Cys 


Asp 


Asn 


Pro 


Asp 


Tyr 


Leu 


Gly 






110 










115 










120 


Pro Ala Pro 


Leu 


Glu 


Asp 


He 


Ala 


Glu 


Gin 


He 


Phe 


Asn 


Ala 


Ala 






125 










130 










135 


Gly Pro Ser 


Gly 


Arg 


Asn 


Thr 


Glu 


Tyr 


Leu 


Phe 


Glu 


Leu 


Ala 


Asn 






140 










145 










150 


Ser lie Arg 


Asn 


Leu 


Val 


Pro 


Glu 


Glu 


Ala 


Asp 


Glu 


His 


Leu 


Phe 






155 










160 








165 


Ala Leu Glu 


Lys 


Leu 


Val 


Lys 


Glu 


Arg 


Leu 


Glu 


Gly 


Lys 


Gin 


Asn 






170 










175 










180 


— Leu-Asn-Gys- 
<210> 26 


-Ile- 
























<211> 154 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_feature 






















<223> Incyte ID 


No; 


044775CD1 
















<400> 26 


























Met Gly Ala 


Phe 


Glu 


Cys 


Val 


Arg 


Lys 


Val 


Tyr 


Gin 


Thr 


Asp 


Gly 


1 




5 










10 










15 


Leu Lys Gly 


Phe 


Tyr 


Arg 


Gly 


Met 


Ser 


Ala 


Ser 


Tyr 


Ala 


Gly 


He 






20 










25 






30 


Ser Glu Thr 


Val 


He 


His 


Phe 


Val 


He 


Tyr 


Glu 


Ser 


He 


Lys 


Gin 






35 










40 










45 


Lys Leu Leu 


Glu 


Tyr 


Lys 


Thr 


Ala 


Ser 


Thr 


Met 


Glu 


Asn 


Asp 


Glu 






50 










55 








60 


Glu Ser Val 


Lys 


Glu 


Ala 


Ser 


Asp 


Phe 


Val 


Gly 


Met 


Met 


Leu 


Ala 






65 










70 








75 


Ala Ala Thr 


Ser 


Lys 


Thr 


Cys 


Ala 


Thr 


Thr 


He 


Ala 


Tyr 


Pro 


His 






80 










85 








90 


Glu Val Val 


Arg 


Thr 


Arg 


Leu 


Arg 


Glu 


Glu 


Gly 


Thr 


Lys 


Tyr 


Arg 






95 










100 










105 


Ser Phe Phe 


Gin 


Thr 


Leu 


Ser 


Leu 


Leu 


Val 


Gin 


Glu 


Glu 


Glv 


Tvr 






110 










115 








120 


Gly Ser Leu 


Tyr 


Arg 


Gly 


Leu 


Thr 


Thr 


His 


Leu 


Val 


Arg 


Gin 


lie 






125 










130 










135 


Pro Asn Thr 


Ala 


He 


Met 


Met 


Ala 


Thr 


Tvr 


Glu 


Leu 


Val 


Val 


Tvr 






140 










145 










150 


Leu Leu Asn 


Gly 
























<210> 27 


























<211> 438 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_f eature 






















<223> Incyte ID 


No: 


116588CD1 
















<400> 27 


























Met Leu Leu 


Val 


Thr 


Pro 


Arg 


Pro 


Glu 


Arg 


Gly 


Gly 


Arg 


Gly 


Thr 


1 




5 










10 










15 


Glu Leu Gly 


Glu 


Phe 


Cys 


Gly 


Thr 


Pro 


Leu 


Leu 


Phe 


Ser 


Ser 


Tyr 






20 










25 










30 


Phe Cys Tyr 


Asp 


Asn 


Pro 


Ala 


Ala 


Leu 


Gin 


Thr 


Gin 


Val 


Lys 


Arg 
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Arg Trp Glu Leu Asn Ala Met Thr Thr Asn Ser Asn He Ser Arg 

365 370 375 

Pro He Val Ser Ser His Gly 
380 

<210> 36 

<211> 287 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2676443CD1 

<400> 36 

— Met_Al a Ala Glu Ala Arg Val Ser Ar g Tr p Tyr Phe Gly G ly Leu 

1 5 10 rs : 

Ala Ser Cys Gly Ala Ala Cys Cys Thr His Pro Leu Asp Leu Leu 

20 25 30 
Lys Val His Leu Gin Thr Gin Gin Glu Val Lys Leu Arg Met Thr 

35 40 45 
Gly Met Ala Leu Arg Val Val Arg Thr Asp Gly lie Leu Ala Leu 

50 55 60 

Tyr Ser Gly Leu Ser Ala Ser Leu Cys Arg Gin Met Thr Tyr Ser 

65 70 75 
Leu Thr Arg Phe Ala lie Tyr Glu Thr Val Arg Asp Arg Val Ala 

80 85 90 

Lys Gly Ser Gin Gly Pro Leu Pro Phe His Glu Lys Val Leu Leu 

95 100 105 
Gly Ser Val Ser Gly Leu Ala Gly Gly Phe Val Gly Thr Pro Ala 

110 115 120 

Asp Leu Val Asn Val Arg Met Gin Asn Asp Val Lys Leu Pro Gin 

125 130 135 

Gly Gin Arg Arg Asn Tyr Ala His Ala Leu Asp Gly Leu Tyr Arg 

140 145 150 

Val Ala Arg Glu Glu Gly Leu Arg Arg Leu Phe Ser Gly Ala Thr 

155 160 * 165 

Met Ala Ser Ser Arg Gly Ala Leu Val Thr Val Gly Gin Leu Ser 

170 175 180 

Cys Tyr Asp Gin Ala Lys Gin Leu Val Leu Ser Thr Gly Tyr Leu 

185 190 195 

Ser Asp Asn lie Phe Thr His Phe Val Ala Ser Phe lie Ala Gly 

200 205 210 

Gly Cys Ala Thr Phe Leu Cys Gin Pro Leu Asp Val Leu Lys Thr 

215 220 225 

Arg Leu Met Asn Ser Lys Gly Glu Tyr Gin Gly Val Phe His Cys 

230 235 240 

Ala Val Glu Thr Ala Lys Leu Gly Pro Leu Ala Phe Tyr Lys Gly 

245 250 255 

Leu Val Pro Ala Gly He Arg Leu He Pro His Thr Val Leu Thr 

260 265 270 

Phe Val Phe Leu Glu Gin Leu Arg Lys Asn Phe Gly He Lys Val 

275 280 285 

Pro Ser 
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<211> 497 
<212> PRT 
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<220> 
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<223> Incyte ID No: 3295764CD1 

<400> 37 

Met Asp Val Pro Gly Pro Val Ser Arg Arg Ala Ala Ala Ala Ala 

15 10 15 

Ala Thr Val Leu Leu Arg Thr Ala Arg Val Arg Arg Glu Cys Tro 

20 25 30 
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460 
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Tyr 


Phe 
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465 


He 
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Val 


Val 


Phe 
470 


Leu 
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Ser 


Gly 
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475 


Val 


Ser 


Val 


Met 


Lys 
480 


Lys 


Cys 


Arg 


Lys 


Leu 
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Asp 


Pro 


Gin 


Ser 
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Ser 
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Gin 


Val 


Thr 
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<223> Incyte ID 


No: 


3438320CD1 
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Met 


Pro 


Arg 


Arg 
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Leu 


Val 
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Gly 


Pro 
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Leu 


GlU 


• Tyr 1 Phe 


1 
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10 










15 


Gin 


Arg 


Arg 


Tyr 


Phe 


1 nr 


Pro 


Ala 


Glu 


Val 


Ala 


Gin 


His 


Asn 












20 










25 
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Pro 


Glu 


Asp 


Leu 


Trp 


Val 


Ser 


Tyr 


Leu 


Gly 


Arg 


Val 


Tyr 


Asp 


Leu 
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40 










Thr 


Ser 


Leu 


ax a 


Gin 


Glu 


Tyr 


Lys 
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Leu 


Leu 


Leu 


Lys 


Pro 










50 










55 










o u 


lie 


Val 


Glu 


Val 


Ala 


Gly 


Gin 


Asp 


He 


Ser 


His 


Trp 


Phe 


Asp 


Pro 










65- 










70- 






*7 


Lys 


Thr 


Arg 


Asp 


Ile 


Arg 


Lys 


His 


He 


Asp 


Pro 


Leu 


Thr 


vjiy 


Cys 










80 










85 










Leu 


Arg 


Tyr 


Cys 


Thr 


Pro 


Arg 


Gly 


Arg 


Phe 


Val 


His 


Val 


Pro 


Pro 


Gin 








95 










100 










1UD 


Leu 


Pro 


Cys 


Ser 


Asp 


Trp 


Ala 


Asn 


Asp 


Phe 


Gly 


Lys 


Pro 


Trp 










110 










115 










120 


Trp 


Gin 


Gly 


Ser 


Tyr 


Tyr 


Glu 


Val 


Gly Arg 


Leu 


Ser 


Ala 


Lys 


Thr 










125 










130 
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Arg 


Ser 


He 


Arg 


He 


T 1 ~ 
J — Lc 


Asn 


x nr 


Leu 


Thr 


Ser 


Gin 
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His 


Thr 










140 










145 
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Leu 


Glu 


Val 


Gly 


Val 


Leu 


Glu 


Ser 


He 


Trp 


Glu 


He 


Leu 


His 


Arg 










155 










160 










165 


Tyr 


Leu 


Pro 


Tyr 


Asn 


Ser 


His 


Ala 


Ala 


Ser 


Tyr 


Thr 


Trp 


Lys 


Tyr 










170 










175 




180 


Glu 


Gly 


Lys 


Asn 


Leu 


Asn 


Met 


Asp 


Phe 


Thr 


Leu 


Glu 


ril ii 


Asn 


Gly 










185 










190 










195 


lie 


Arg 


Asp 


Glu 


Glu 


Glu 


Glu 


Phe 


Asp 


Tyr 


Leu 


Ser 


Met 


Asp 


Gly 
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205 








iilO 


Thr 


Leu 


His 


Thr 


Pro 


Ala 


He 


Leu 


Leu 


Tyr 


Phe 


Asn 


sp 


Asp 


Leu 
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Thr 


Glu 
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<210> 39 
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<213> Homo sapiens 






















<220> 




























<221> misc_feature 






















<223> Incyte ID 


No: 


3986488CD1 
















<400> 39 


























Met 


Ala 


Ala 


Thr 


He 


Met 


He 


Leu 


Tyr 


Val 


Ser 


Lys 


Leu 


Asn 


T ire? 

Lys 


1 








5 










10 










15 


He 


lie 


His 


Phe 


Pro 


Asp 


Phe 


Asp 


Lys 


Lys 


He 


Pro 


Val 


Liys 


Leu 










20 










25 








30 


Phe 


Pro 


Leu 


Pro 


Leu 


Leu 


Tyr 


Val 


Gly Asn 


His 


He 


Ser 


Gly 


Leu 










35 










40 








45 


Ser 


Ser 


Thr 


Ser 


Lys 


Leu 


Ser 


Leu 


Pro 


Met 


Phe 


Thr 


Val 


Leu 


Arg 










50 










55 










60 


Lys 


Phe 


Thr 


He 


Pro 


Leu 


Thr 


Leu 


Leu 


Leu 


Glu 


Thr 


He 


He 


Leu 


Gly 








65 










70 










75 


Lys 


Gin 


Tyr 


Ser 


Leu 


Asn 


He 


He 


Leu 


Ser 


Val 


Phe 


Ala 


He 










80 










85 










90 


He 


Leu 


Gly 


Ala 


Phe 


He 


Ala 


Ala 


Gly 


Ser 


Asp 


Leu 


Ala 


Phe 


Asn 










95 










100 










105 


Leu 


Glu 


Gly 


Tyr 


He 


Phe 


Val 


Phe 


Leu 


Asn 


Asp 


He 


Phe 


Thr 


Ala 










110 










115 








120 


Ala 


Asn 


Gly 


Val 


Tyr 


Thr 


Lys 


Gin 


Lys 


Met 


Asp 


Pro 


Lys 


Glu 


Leu 










125 










130 








135 


Gly 


Lys 


Tyr 


Gly 


Val 


Leu 


Phe 


Tyr 


Asn 


Ala 


Cys 


Phe 


Met 


He 


He 










140 










145 










150 


Pro 


Thr 


Leu 


He 


He 


Ser 


Val 


Ser 


Thr 


Gly 


Asp 


Leu 


Gin 


Gin 


Ala 
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155 








Thr 


Glu 


Phe 


Asn 


VJJLll 

170 


Trp 


Lys 


Asn 


Leu. 


Leu 


Ser 


Cys 


rue 
J- o _> 


Leu 




riie 


Leu 




Q <=k >- 


lyr 


Tyr 
200 


Asn 


Cor 


/\j.a 


Ala 


He 


Lys 


Asn 


Val 
215 


Ser 


Val 


Ala 


Gly 


Asp 


Tyr 


He 


Phe 
230 


Ser 


Leu 


Leu 


Cys 


Met 


Ala 


Gly 


Gly 
245 


Leu 


Arg 


Tyr 


Gin 


Leu 


Lys 


Pro 


Lys 


Pro 


Val 


Gly 










260 






Leu 


Lys 


Ser 
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<211> 206 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4378816CD1 

<400> 40 



Met 


Gly 


Ala 


Glu 


Trp 


Glu 


Leu 


Gly 


1 








5 








Leu 


Leu 


Cys 


Ala 


Ala 
20 


Leu 


Leu 


Ala 


Arg 


Leu 


Gly 


Arg 


Gly 
35 


Gin 


Gly 


Ala 


Trp 


Leu 


Cys 


Tyr 


Asp 
50 


Ala 


Leu 


Val 


Phe 


Val 


Tyr 


Leu 


Ser 


Leu 


Val 


Gly 










65 






Leu 


He 


Ala 


Ser 


Leu 
80 


Trp 


Lys 


Glu 


Trp 


Val 


Tyr 


Phe 


Asp 
95 


Pro 


Thr 


He 


Val 


Ala 


Leu 


Asp 


Gly 
110 


Ser 


Leu 


Ala 


Val 


Lys 


Glu 


Lys 


Tyr 
125 


Tyr 


Arg 


His 


Val 


Cys 


Glu 


Leu 


Tyr 
140 


Gly 


Cys 


Trp 


Leu 


Thr 


Arg 


Ser 


Pro 
155 


Asn 


Leu 


Asn 


Trp 


Leu 


Tyr 


Leu 


Phe 
170 


Phe 


Phe 


Asn 


Gly 


Leu 


Leu 


Leu 


Trp 
185 


Gin 


Ser 


Trp 


Gin 


Lys 


Glu 


Thr 


Ser 


Ser 


Val 


Lys 



200 
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<211> 235 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4797137CD1 
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Tyr 


He 


Gly 


He 


Leu 


He 


Gly 




220 
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Phe 


Val 
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Leu 


Asn 


He 
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Ser 


Phe 


Leu 


Thr 


Leu 


Ser 


Ser 
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Glu 


Glu 


Asn 


He 


Cys 


Leu 


Asp 
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270 
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Glu 
10 


Ala 
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Gly 


Ser 


Leu 
15 


Ala 


Gly 
25 


Cys 


Ala 


Leu 


Gly 


Leu 
30 


Ala 
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Ala 


Leu 
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45 
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Phe 
55 


Ala 


Leu 
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60 


Asn 


Val 
70 
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Asn 
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Gly 
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Gly 
85 


Lys 


Ala 


Asp 
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Val 
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Leu 
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Glu 
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<400> 41 

Met Gin Gin Arg Gly 
1 5 
Pro Leu Leu Gly Val 



Ala Ala Gly Ser Arg 
10 

Leu Phe Phe Gin Gly 



Gly Cys Ala Leu Phe 
15 

Val Tyr He Val Phe 
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<223> Incyte ID No: 5470806CD1 
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<400> 43 

Met Val His Leu Thr 
1 5 
Trp Gly Lys Val Asn 
20 

Ser Leu Leu lie Val 
35 

Phe Gly Asp Leu Ser 
50 

Val Lys Ala His Gly 
65 

Leu Lys His Leu Asp 
80 

Glu Leu His Cys Asp 
95 

Leu Leu Gly Asn Met 
110 

Lys Glu Phe Thr Pro 
125 

Ala Gly Val Ala Ser 
140 

<210> 44 
<211> 2701 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 264114CB1 



Asp Ala 
Pro Val 
Tyr Pro 
Ser Val 
Glu Lys 
Asn Leu 
Lys Leu 
lie Val 
Ser Ala 
Ala Leu 



Glu Lys 
Glu He 
Trp Thr 
Ser Ala 

Val He 

i 

Lys Gly 
His Val 
lie Met 
Gin Ala 
Ala His 



Ala 

10 
Gly 

25 
Gin 

40 
He 

55 
Asn 

70 
Thr 

85 
Asp 
100 
Met 
115 
Ala 
130 
Lys 
145 



Thr 
Ala 
Arg 
Met 
Ala 
Phe 
Pro 
Gly 
Phe 
Tyr 



Val 
Glu 
Tyr* 
Gly 
Phe 
Ala 
Glu 
His 
Gin 
His 



Asn 
Ser 
Phe 
Asn 
Asp 
Ser 
Asn 
His 
Lvs 



Gly 
Leu 
Ser 
Pro 
Asp 
Leu 
Phe 
Leu 
Val 



Leu 

15 
Ala 

30 
Lys 

45 
Gin 

60 
Gly 

75 
Ser 

90 
Arg 
105 
Gly 
120 
Val 
135 



<400> 44 

gcggcggcgc 

gagggccacg 

agcggatccc 

ggatgttgcc 

gcagatttga 

caggatgacc 

gtcagtcttg 

tattcacttg 

tggaatacag 

ttgttggaac 

gttgagggca 

agggccatgt 

gtcgatgctg 

aatgacagca 

ttacaacgtt 

ttgagttttg 

gagcacaagt 

ctgatagata 

gctgctatgc 

cttagccaaa 

ctgccaccca 

ttcaaaccaa 

gaaaatcctt 

aagttatgta 

gatatctaca 

acaagcaaac 

caaaatacct 

tgctgtgaag 

aggaccgtag 

tgtgcttggc 

aacctcatgt 

actagtaggt 

ttgcatattg 

gtaactgtgg 

gctaagaggt 

tttatgcgag 

gaacttgacc 



cagcttcctc 
cccccgacgc 
acacctccgg 
acaaaatcta 
gcacaatgtc 
taggagtgat 
gtactgaatg 
accttccatc 
tcatgtccac 
agttgaaact 
ctgaattccc 
ttatgagtgg 
ccaccttaca 
ctgtagaaca 
gtcgagaata 
ctgatctctt 
tcactgctgt 
ttctcagtag 
tgtggctaga 
tcagaattga 
atgataagtc 
gacttgggat 
gtagtcttta 
gcccaccagc 
tagcaggggg 
ttcagactgc 
ggtttccaaa 
gctatatcta 
aaagatacga 
aatggagtgc 
actgttattt 
cctttgcttc 
ctaccaattc 
aaatttatga 
actctgaccc 
aaacccactt 
ggtggtctct 



ggccggaggg 
cgcgccggag 
accgagggac 
cctcgtgtat 
tgcagactgt 
tctaagtctg 
ctgcagataa 
agtaagacgg 
tcaagacgag 
gttttatgaa 
ttgtcataag 
actaagtgaa 
gataataata 
gctttatgaa 
tttaattaaa 
cagttgtgag 
gtatcatcag 
tgacaattta 
gtataacaca 
tgcactttca 
agtggtggtt 
gactaaagag 
ctcttctgtc 
tgatttgcat 
tcaagttcct 
cttcagaact 
gaccccaatg 
tgcaattgga 
cactgagaaa 
agcagttgtg 
tccaaggtct 
agctgcagct 
cggca taaga 
tgtgaataaa 
ctgtgttaga 
aaacgagcga 
gcggcagcat 



gaggcgagac 
gggcccagtg 
gcggttactc 
ttttctcttt 
gttgaaaaac 
tgtaacaaga 
cagcaagtag 
acttttctaa 
aggcagatca 
cagcagttgt 
atggttcttg 
agcaaacaaa 
acttatgcat 
acagcttgct 
aaaataaatg 
gaattaaaac 
gacgcgttca 
aatgtagaaa 
gaatcacgat 
gaagtaacac 
caaggtctgt 
gaaatgatga 
tgttacagcc 
aaggttggga 
ctgaaaaaca 
gtgaattgct 
ctttttgtcc 
ggagatagcg 
gatgagtgga 
gtccatgact 
gactcatggg 
tttggtgata 
ctcccctctg 
aacgagtgga 
gctgttgtga 
gctaaatacg 
atatctgaac 



cccaggcgag 
tggacggggc 
cacaggatcc 
cactcatgag 
tctgaagaac 
tattactcat 
gttctccttt 
tctgttcctg 
atactgaata 
ttactgacat 
caacatgtag 
cccatgtaca 
acacgggtaa 
tcctacaggt 
cagagaattg 
agagtgctaa 
tgcagctgtc 
aggaagaaac 
cccagtattt 
agagagcttg 
ataagtccat 
ttttcattga 
cccaagcaga 
ccgttgtaac 
caaaaacaaa 
tttattggtt 
gcataaagcc 
taggtggaga 
cgatggtaag 
gcatttatgt 
tagaaatggc 
aaattttcta 
gcactgtaga 
aaatggcagc 
tctcaaattc 
tcacctacca 
gtgtactgtg 



gccgcggcgg 
caccggctgg 
gctgaacata 
ctgcacaatt 
ctaattaaca 
tagtgaatgt 
atttctgaag 
gagatattaa 
tgctgtgtca 
agtgttaatt 
ctcttatttc 
cctgaggaat 
cttggcaatg 
agaagatgtg 
tgtacgattg 
aagaatggtg 
acatgaccta 
cgttcgagaa 
gtcttctgtt 
gtttcaaggt 
gcccaagttt 
agcatcttca 
aaaagtttac 
tcctgataat 
tcacagtaaa 
tgatgcacag 
atctttggtt 
acttaatcgg 
ccctttacct 
gatgacactg 
catgagacag 
tattggaggg 
tgggtcttca 
caacatccct 
tctatgtgtg 
atatgacctg 
ggacttgggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 
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agagattttc 
aaaccaccaa 
atggttgcac 
ctttgtcatt 
aattctatct 
tatatcagtg 
ttgagaaact 
ccctgtgtat 
t 



gatgcactgt 
cttatctttt 
taccacctgt 
ttctttgcta 
ttgataaatt 
agtcttacag 
ttatgtgtaa 
gtacagttca 



ggggaaactc 
ttcaacggat 
atagtgggga 
aacaaaagag 
ttatttttat 
aaagatatgc 
tcatgagagt 
aaaagttcat 



<210> 45 

<211> 736 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1455669CB1 



tatccatcct 
gggacagaag 
agttcaggga 
gctatgaaag 
gccctactta 
ttccataata 
ataagaatct 
ttataaaagt 



gccttgaaga 
agtttgaact 
gtgcacgcct 
aactaaatat 
atatttigcat 
tgaaatagat 
ggattatcta 
agtttcctgt 



gtctccatgg 
ggatggagaa 
gagttatgtg 
gagtacataa 
cagtataata 
tattcaataa 
acattgttag 
tcctagttga 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2701 



<400> 45 

gagacttagc 

ctgctgagcc 

ccagcctgct 

tctgggtcct 

ggatgtcccc 

gcaatgttcg 

tcctatcggg 

gaggtcaact 

catgcgcaaa 

ggcattttcc 

ttaggatgaa 

ataggacaat 

tccaagtgtg 



gacagacaga 
ctgcgcgggg 
gagcctgtgc 
tggtcccatc 
actcttcaaa 
ggggaggcag 
gtgtagaagg 
cagtggaaca 
catacgcata 
aaaacatacc 
ccttttaaaa 
ttttatatgt 
aaaaaa 



cgctgggacc 
cagggggtct 
ttcctgagaa 
tacctcctct 
gccaagatgg 
gggacaggct 
gaatctgtta 
cggttctccc 
cactcccatg 
atgtagactc 
agttatgcat 
gatatgtatt 



cacgacgaca 
ggaaccagac 
cagcaggggt 
tcatccacca 
tagctgccat 
ggaacctggt 
atcaaacaga 
aaacagattt 
ttcctggaca 
ttggatacac 
ttatttttat 
taccttagtg 



gaaggcgccg 
agagcctgaa 
ctgggtaccc 
ccatggccgg 
ccctgggagc 
gaagtcttaa 
gcaatattag 
tgtaattccg 
gtttatagct 
aaggtaattt 
gttcccccac 
tgttaaataa 



atggccgcgc 
cctgccgcca 
cccatgtacc 
ggctacctcc 
ctggaaccag 
agtagactcc 
aaaggctaca 
aaaatccacg 
accataacct 
tagagccaca 
tggctgtatt 
acactggcat 



<210> 46 

<211> 1826 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2084989CB1 



<400> 46 

ccttaggcgc 

ccccagttct 

gttcccctcc 

attgccaccc 

gggacgtgag 

cccgtgccca 

acgcgtgacg 

aaggcggttc 

catcaacatc 

tttcactgtt 

gaacatcgtg 

gtggagggcc 

cctgattggc 

cacattctac 

tgccattgtt 

gacagcctat 

caccaagcac 

caactgcatc 

tgatgaggca 

ggtggtgatt 
ggacactctg 
atgggcgatg 
tctaagactt 
agcaagaatg 



cagggacagc 
ggttctgacg 
ggctcgaacc 
ctccctgggc 
cctgagtttg 
ccgggtgggc 
tcccgcgtga 
tgagagcttc 
caggaacctc 
actgatcctc 
cagaactaca 
aagtatgtgt 
cgcatgtcag 
aggaagaccc 
aactactcca 
gtgagtgcca 
ctgcccccct 
aacatccccc 
ggtcagaggc 
tcaagaatct 
gagaagaaag 
gctgggagaa 
gccagccctt 
aactgggctg 



cgagcgttac 
ccctagctca 
acctcttctc 
acccttgagg 
ctgcaggcgt 
gcggccggga 
tggctgggag 
agagagcgat 
gctgggacca 
gaaatctgct 
gggccggcgt 
atgactccgc 
cccaggtgcc 
caaccgtggt 
accgcagtgg 
ccactggagc 
tggtcggcag 
tgatgaggca 
ttggc tactc 
gcacggcgat 
acctcctgaa 
gtgaccaggc 
cccccgaccc 
gggtgaagga 



ctggtcccgg 
t tccgcaaat 
tgagccgagc 
cctccgtgga 
gctctgtgtg 
agctcctgcc 
ggcccggcgg 
ggaaagcaaa 
aagtactttc 
gctgtccggg 
ggtgacccca 
cttccatccg 
ca tgaacatg 
gttc tggcag 
tgacactccc 
tgtggccacg 
at ttgtgccc 
gagagagctg 
ggtgactgca 
tcctgccatg 
ggtaggcgac 
cccaactctc 
ctgcaccgcc 
actc tgcagg 



gcagcggagt 
ttagggcttg 
cagctaccgg 
gggacgtcac 
gtggctgggt 
cctccctgct 
cgacagcgga 
atgggtgaat 
ctgggcagag 
gcacagctgg 
gggatcaccg 
gacacagggg 
accatcactg 
tgggtgaatc 
atcactgtga 
gccctgggac 
t ttgcagcag 
caggtgggca 
gccaagcagg 
gccatcccac 
tgtacctctc 
tctccagcct 
tcctccacct 
ggcaggagga 



tctttaccca 
ggtctggctt 
ggctcctgga 
ggggcagagc 
tctgccaatc 
ggtcggcgtc 
ggcagagagg 
tgcctttaga 
cccggcactt 
aagcttctcg 
aggaccagct 
agaaggtggt 
gctgcatgct 
agtccttcaa 
ggcagctggg 
tcaaatccct 
tggcagc tgc 
tcccggtggc 
gaatcttcca 
cactgatcat 
ttgtcctgga 
cgcctgattc 
tcgttcattc 
gaggacaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

736 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



37/60 



BNSDOCID: <WO 0078953A2 I > 



WO 00/78953 



PCT/US00/16668 



gaaggaaacc 
catagtcatg 
tacacacggg 
caagctctgg 
gcatgtttta 
aagtttatag 
agaaagtgta 



aacttcatca 
cttgatctca 
tgaacacgca 
ctcattcaca 
gggctgagga 
tctgatgcag 
aaaataaaaa 



gtgttactcc 
tgctcattct 
cacacagaca 
gaatcattta 
gatggcactg 
aaaccaataa 
aaaaaa 



agtggcttct 
tttaccccct 
tgaacaggac 
ttcacaaatg 
aacacaatgg 
acaaggaggc 



gacacacaga 
agtgcctcca 
acgaaagcaa 
tattgagtgc 
ttatggcccc 
acccacataa 



aggggactgt 
tactgagagg 
agcacaggaa 
catgcaccag 
tgtcc tcatg 
atacattctt 



1500 
1560 
1620 
1680 
1740 
1800 
1826 



<210> 47 

<211> 1325 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No; 2501034CB1 



^ a n rw a n 

*» \J ~T I 

ccacgggtcc 
tgcagcaggg 
cctgccagta 
accacggtgt 
tcgagtatga 
gacatattct 
tttgtgaagc 
atgacttgat 
tagaacaaac 
cggcaaaaaa 
gaaaactctt 
aactcattaa 
tcaaccgtaa 
tctacttctt 
aggacttctg 
agagaaaaga 
gattggaaac 
aaatgcaccc 
ccacatggac 
atacgttata 
ttcattcctt 
tttgtgttct 
aaaaa 



ggttctggac 
gaaggggaac 
caggtgcatc 
gctgaagata 
tcctgttatc 
tttcaatact 
atacctgagg 
cagatatgct 
tgcctatttt 
aaagcccaac 
gctgattcca 
agtatttgtt 
gctctccttc 
ccagagtatg 
aagagtttta 
catgaaatat 
actttctgaa 
aagagacaag 
acaaggaatg 
cacaaggcca 
tgacatgttt 
tacaatagaa 



tgcagttgag 
gtggatgggg 
gaatgcaacc 
accatctgta 
atcttgatta 
caaataaata 
tggtggcagc 
aaggaatggg 
attggcattt 
ttcattttgc 
gctgtcattt 
cttacatcaa 
ttggccgtgt 
gaatgggatg 
ttcttcttca 
aaaccaacct 
aaagaaagca 
ccatttacat 
ttgctgcaga 
ggtaagcgct 
atatcttttt 
acgcttttaa 



tggaaatggg 
tggcagcgac 
aggaggccaa 
aatcctgcca 
atgctatatt 
tccatggaaa 
ttcaagattc 
atttctatag 
ttaccttcct 
tgctgaaagc 
gggaacatga 
attttcaggc 
tgagtggctt 
ttggaagtga 
ctatctgtgg 
cctcatttct 
atgataatag 
acagatattc 
gactgaatga 
cataattcac 
aatttaaatg 
taaagtcttt 



caacggcggg 
tcctactgct 
agagttgtac 
gaaacctgta 
gtgcaaagct 
actctgcata 
caaccagaat 
aatgtttgcg 
gtgggtagaa 
attattatta 
ctacacatct 
aattagagtg 
actgctggaa 
ttatgccatc 
catgaccagc 
gttgagtaaa 
cggtggatac 
acagtcacac 
catgcaacag 
acataataaa 
ttgttactgg 
cagaataaac 



cggagcggcc 
gcctcggcct 
cgagactata 
gacaaatata 
caggcctaca 
ttttgtttgc 
actgcccctg 
attgctgctt 
cggcccatga 
tctagctacg 
gtgtgcctca 
accctaaaca 
agcatcatgg 
tttaaatctc 
tgtatctgaa 
atgaagcaaa 
ccacccccac 
atagaaacac 
gtgaaggttt 
acatc taggt 
cttaaaatat 
caaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1325 



<210> 48 

<211> 1832 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2745212CB1 



<400> 48 

tgggctgtcg 

gctggcaaat 

gttgggtggg 

cgaatcactt 

gagtgctgtg 

aggacaaaca 

gggttccgga 

ttcaaaaagc 

agaagacatc 

aaacacctgg 

aaagggcccg 

ttaaagtaaa 

ccaggcaaaa 

attatgcaaa 

aagttgtggc 

aatttgagag 



ttggctggag 
ccggcccagg 
gtagagagta 
ataaatggcg 
ctttcatggg 
agtgaaatac 
gagcagagta 
caatcaggag 
tggtctgcaa 
aaatggagat 
ggtagatcct 
gattcctgaa 
acagctctt t 
ttacaagaaa 
agggataaaa 
accacagtat 



cagcggctgc 
atgtagagct 
gggggcggta 
ccgaagcagg 
cctcttcttt 
ttcatacatt 
c tcaaatacg 
cagtatgcag 
cagaaaaatg 
ggtggcagta 
actgttgaaa 
gagctaaaac 
tatcttcctg 
tctcgtggaa 
gaatacttca 
gctgaaattc 



gcgggtcgcg 
ggcagtgcct 
gtcgggggtg 
acccgaagcc 
atgaagcaaa 
acagtggttg 
tggacaccaa 
aggggaagat 
ttgaagtgaa 
ccagtgagac 
atgaggaaac 
cgtggcttgt 
ccaagaagaa 
acacagataa 
acgtaatgtt 
t tgcagatca 



gtgctgtgag 
gacggcgcgt 
gtgggagaag 
taaattccag 
gtgtgtaaag 
gaataaaaat 
tttgcagaaa 
gagaggggct 
aacgaaaaag 
ccctcagcct 
attcatgaac 
tgatgactgg 
tgtggattcc 
taaggagtat 
gggtacccag 
tcccgatgca 



gtctgcgggc 
ctgacgcgga 
gaggaggcgg 
gagggtgagc 
gttgccataa 
tgggatgaat 
cagcgagaac 
gccccaggaa 
aacaaacaga 
cctcggaaga 
agagttgaag 
gacttaatta 
attcttgagg 
gcggttaatg 
ctactctata 
cccatgtccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



38/60 



SNSDOC1D. <WO O078953A2 I > 



WO 00/78953 



PCT/US00/16668 



aggtgtatgg 
atacacctct 
agtacctggc 
ctgagtacca 
taaacacatt 
tgtataacaa 
taatagctcc 
tcaatgatgg 
gcttttgaat 
tgctatatga 
gaacacactc 
tcctttgcaa 
aatggtggct 
cgcaatgcat 



agcgccacat 
ggatgagaag 
aaagaattct 
tcggaaagct 
tttgttctta 
ttgatgtttg 
ttttttcttc 
acaacagagg 
gctggtggtt 
caaaatgctc 
atccatttgt 
gtcatccatg 
tgtttcatgg 
tgtgtagcta 



ctcctgagat 
agccttgctt 
gcaactttgt 
gtgtgagagg 
gtctatctct 
ttttctgttt 
tttctttttt 
gatatgctgt 
ctattccttt 
tgattcctag 
gcttttgttt 
ttgttactta 
tttttgtatt 
gttt tctgga 



atcttcaata aattttttct ttaaatttca 



tatttgtacg 
tattactcaa 
tcagtgccag 
cactctcact 
tgtacaaacg 
gattttaaac 
tttttcattt 
agagtgtttt 
gacactacgc 
tgccaaaggt 
ttttttatgg 
ggcattttat 
tgtgtctaat 
aaag tcaatc 
aa 



aattggagca 
ttatcttcac 
cgattatgaa 
cacttatgtt 
atgtgctttg 
agagaaaaaa 
caaaattgct 
attgcctagt 
acttttataa 
tcaattcagt 
tgcttaaagt 
cttggctcaa 
gcacgtttta 
jt tt t agga a t _ 



atgttggctt 
gatttcctaa 
gtggctcctc 
tggatctccg 
aagatgttag 
taaaaggggg 
gccagtgttt 
tgacaaagct 
tacatgttaa 
gtatataact 
aaagagccca 
attgttgaag 
acatgataga 
_tg.ttt.ttcag- 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
.1800- 
1832 



<210> 49 

<211> 1211 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4833111CB1 



<400> 49 

gacagacccg cctcaaacat ggcggcgccc agcgcgcgag gacgtgatcc gcttctgctc 60 

cggcttggat tgtagccttg acgaggtctg agcgaccatg gaccggccgg ggttcgtggc 120 

agcgctggtg gctggtgggg tagcaggtgt ttctgttgac ttgatattat ttcctctgga 180 

taccattaaa accaggctgc agagtcccca aggatttagt aaggctggtg gttttcatgg 240 

aatatatgct ggcgttcctt ctgctgctat tggatccttt cctaatgctg ctgcattttt 300 

tatcacctat gaatatgtga agtggttttt gcatgctgat tcatcttcgt atttgacacc 3 60 

tatgaaacat atgttggctg cctctgctgg agaagtggtt gcctgcctga ttcgagttcc 420 

atctgaagtg gttaagcaga gggcacaggt atctgcttct acaagaacat ttcagatttt 480 

ctctaacatc ttatatgaag agggtatcca agggttgtat cgaggctata aaagcacagt 540 

tttaagagag attccttttt ctttggtcca gtttccctta tgggagtcct taaaagccct 600 

ctggtcctgg aggcaggatc atgtggtgga ttcttggcag tcagcagtct gtggagcttt 660 

tgcaggtgga tttgccgctg cagtcaccac ccctctagac gtggcaaaga caagaattac 720 

gctggcaaag gctggctcca gcactgctga tgggaatgtg ctctctgtcc tgcatggggt 780 

ctggcggtca caggggctgg caggattatt tgcaggtgtc ttccctcgaa tggcagccat 840 

cagtctggga ggtttcatct ttctgggggc ttatgaccga acgcacagct tgctgttgga 900 

agttggcaga aagagtcctt gaagcagaga caagcctcac ctccacttct gtcaagagag 960 

gggcctgcag tgcaaaccct cttccgctga gcagctgtct gaactatagg ccccagtgct 1020 

gaagaccagt tgtgctaaga taccggcatg gagattgtgc catccgtggt ataggctggc 1080 

tggtatgaag tcattggcct gtatgccaga gagctaagag aagaaaacgg ggtctgtggc 1140 

ggtactctga acaatttcct cagaacctct taataaataa gtttggtaat gctgagaaaa 1200 

aaaaaaaaaa a 1211 



<210> 50 

<211> 1046 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 876677CB1 



<400> 50 

cccacgcgtc 

acatctgagt 

ataaaagctt 

agagaatttc 

cctctctttc 

cattaaggag 

caaggggcca 

ccgaacagtt 

tcaggtagac 

caacgatgtc 



cgggaatgtc 
tgctgcctgt 
ctcaagaatg 
gtgggtgtca 
ctgttggtga 
tatgaacgtg 
ggtttgatcc 
acttgcaaca 
ggagttgtc t 
catcaagcaa 



tttcactctc 
tcttcacact 
agatggattc 
acaataaacg 
tcattacctt 
ctgttgtatt 
tggtcctgcc 
ttcctccaca 
attacagaat 
catttctgcc 



tcatactttc 
tagctccaaa 
tagggtgtct 
gcttggtgta 
ccccatctcc 
ccgtccggga 
atgcatagat 
agagatcctc 
ctatagtgct 
ggcccaaacc 



tcctctcccc 
cccatgaaaa 
tcacctgaga 
tgtggc tgga 
atatggatgc 
cgcatccaag 
gtgtttgtca 
accagagact 
gtctcagcag 
actctgagaa 



tctcccaagc 
attgccaagt 
agcaagataa 
tcctgttttc 
gcttgaagat 
ctgacaaagc 
aagttgacct 
ccgtaactac 
tggctaatgt 
atgtcttagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



39/60 



Bi.'SOOCID: <WO 0078953A2 l > 



WO 00/78953 



PCT/US00/16668 



gacacagacc ttgtcccaga tcttagctgg acgagaagag atcgcccata gcatccagac 660 

tttacttgat gatgccaccg aactgtgggg gatccgggtg gcccgagtgg aaatcaaaga 720 

tgttcggatt cccgtgcagt tgcagagatc catggcagcc gaggctgagg ccacccggga 7 80 

agcgagagcc aaggtccttg cagctgaagg agaaatgaat gcttccaaat ccctgaagtc 840 

agcctccatg gtgctggctg agtctcccat agctctccag ctgcgctacc tgcagacctt 900 

gagcacggta gccaccgaga agaattctac gattgtgttt cctctgccca tgaatatact 960 

agagggcatt ggtggcgtca gctatgataa ccacaagaag cttccaaata aagcctgagg 1020 

tcctcttgcg gtagtcagct attgca 1046 



<210> 51 
<211> 1660 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2326143CB1 



<400> 51 

gctcccctgc 

cggcgcgggc 

gccgccaccg 

aaaggaaatt 

gcatcttgtt 

tgcatattaa 

cagccacagt 

caagaacatg 

gcagcgttac 

cttgaacaca 

atgtttcacc 

actagcagtt 

gaagccatgg 

ggaggacagc 

ctgcagctag 

gcaacccggc 

accaacatag 

acaaggttga 

gcagaccgca 

tcatcctcag 

gatattatgc 

aatgagcctc 

ctgtatttgc 

caggtctgtc 

taatcatttc 

ggtgatctct 

actgaagagg 

aataaaaact 



ccaccccgcc 
agccgtgacc 
tctgaactag 
ttcgaggtcg 
atgaaagtgg 
caagggtaga 
cttttacttg 
ttacttctga 
ctggaggcga 
gagcccctag 
ctggccgggg 
ctactggtgg 
atcctatagc 
ttaattcctc 
aacggcagca 
gtactaacac 
ctaatacaga 
atgatcctaa 
gcctgtttgt 
atgaggatga 
ctttagatgt 
caccacctcc 
caatgaaagt 
actcttgtta 
cacttaacta 
gcttcctgta 
aaaattgaaa 
gcttggatgg 



cccgtggccg 
cccgcggggg 
gatgtcccga 
cagatataag 
tgcaacaaca 
ttttgattta 
tccctattgt 
acatgcagaa 
tcctaatcat 
agatttagat 
attaggaggt 
actttcttct 
tgagctttta 
tggcccttcc 
tgcccaggca 
aagcagtgtc 
aagcagtcag 
aatgtctgaa 
ccaagagctc 
tcggggggag 
tgctttagaa 
tctttgatga 
ggacaacaac 
cattgtgtac 
atttttactt 
ccttgacatg 
tctctaatga 
tttaccttaa 



agcccgggag 
acactgcagc 
catgaaggtg 
tgtttaattt 
acaaggcata 
tactatggtg 
ggaaaaatgg 
acatcaacag 
gtcacggatg 
gaatcgagtg 
cctcgtgctc 
tctcagagtt 
tctcagttat 
gcttctcagt 
gcacggcaac 
accactacaa 
cagactctac 
acggagcgcc 
cttctgtcca 
atggcagatt 
aacctaaatt 
catcccaatt 
tatcttgggt 
attcaaaagg 
ctagcaggta 
caaaaggctc 
agctgctgtg 
aaaaaaaaaa 



tcgagtggga 
cggagcccgg 
tcagctgtga 
gctacgatta 
caactgacca 
gggaagcttt 
gctatacgga 
aagtgatttg 
actttgcagc 
gtgttcgaca 
gtagatcaaa 
catattctcc 
caggagtgag 
tacaacaact 
aactggagac 
tcacacaatc 
agaattccca 
agtccatgga 
ctttagtgcg 
ttggtgctat 
taaaagagag 
cgcagacaat 
ttgtttggtg 
aagagagaaa 
aatgtaggta 
tcctaatact 
tgtatttatg 



gtcggccggc 
gaggggccgc 
tgcatgttta 
cgatctttgt 
cccaatgcag 
ctctgtagag 
gacatctctt 
tccaatatgt 
tcatcttaca 
tgtacgtaga 
catgcacttt 
aagcaatagg 
acgttctgca 
gcagatgcag 
cgcacgcaac 
cacagcaaca 
gtttctttta 
aagcgagcgt 
tgaagagagc 
gggctgtgta 
taataaagga 
gtcctctgtg 
attgtaattt 
atatatatga 
gcagtgcagg 
ccacattcaa 
aatattaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 



<210> 52 
<211> 1110 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 2786302CB1 



<400> 52 

cctttattag 

cagtttcttc 

ggataaagaa 

tttagatttg 

ttttaaaaaa 

aacctttctt 

tcaaatcttc 

ttgagtgtga 

cttcagagtg 

acaatgaata 



agaaagagtg 
tgtcttctgc 
taactctagg 
gctttaaaag 
attaatgtgg 
ccacatgaga 
aagaaaatga 
atgccatggt 
ttattttcct 
aagacccagt 



acctatctac 
tcctggacat 
acaagagttg 
cccatactat 
gctttttaat 
agtattttac 
tcaaatcttc 
gatgctaatt 
ttttgtcata 
t ttcagaatc 



cccttacttg 
ggcatgaaaa 
ctttggggct 
tttattagct 
tctttctcag 
aagttgcttg 
tgtgacaaaa 
ctgattattt 
aggtgtctgg 
tctccacgga 



gggaagactc 
cggaagtttg 
aaacctatct 
cttctatgct 
tgggagggtg 
tcaaatttaa 
aaatggacaa 
ttcttttctt 
aaatgaagta 
gtagagaaac 



tctgcttcaa 
agtatttcaa 
cctgtattca 
1 1 1 taaaag t 
agtttgaata 
aagaaaatga 
atattcacca 
ttagccatac 
tggaaatgaa 
tcatcccaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



40/60 



BNSDOCID; <VJO 0078953A2 t > 



WO 00/78953 



PCT/US00/16668 



ccggaagagc 
gcactcactg 
gaatattatg 
aatgtttcct 
ggtaaaagta 
gtgttacaag 
gggtactgcc 
ttgtatgcag 
ggtgggaggt 



ccgaagaaga 
ctccaaactt 
agacaaagaa 
tttgtgttaa 
cttccattaa 
gcagcagagc 
ctcaggagaa 
ctgtggaaag 
cttaagctcc 



agatgaagat 
ggaggaggaa 
aagttgcttt 
aaaaggtgaa 
aatgataact 
atcagtaagg 
ctcactgtgg 
actggggcaa 
aggaacaact 



gttcaagctg 
ccagtcataa 
tcaacaagaa 
gttttgggat 
gggtgcacaa 
caacagcatg 
cccaagctta 
aaaagatgct 



aaagagtcca 
ctgcaagctg 
agaagaaaat 
tactaggaca 
agccaactgc 
acaacagcct 
ccatgaaaga 
gctctcagta 



agcagcaaat 
tttacacaag 
agccatcaga 
caatggagct 
aggagtggtg 
caagttcttg 
gcacttggag 
tttcacgatt 



660 

720 

780 

840 

900 

960 

1020 

1080 

1110 



<210> 53 

<211> 1120 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 3735780CB1 



<400> 53 

gaaatccagt tatcaaaatt gactcaagaa gagagaacct aacagaacaa taacaatgga 60 

agaaattggg aacattatca caaagctatc atcctgccaa actccaggct cagatgtcac 120 

aggttaaaaa aagtccttca tgaaaaagaa agatcttaag cagcatgatg gattcagaag 180 

ctcatgaaaa gaggccacca atactaacat cttcaaaaca agatatatca cctcatatta 240 

caaatgttgg tgagatgaag cattacttgt gtggctgctg tgcagccttc aacaatgtcg 3 00 

caatcacatt tcccattcag aaggtcctct ttcgacaaca gctgtatggc atcaaaaccc 3 60 

gggatgcaat acttcagttg agaagggatg gatttcgaaa tttgtatcgt ggaatccttc 420 

ccccattgat gcagaagaca actacgcttg cacttatgtt tggtctgtat gaggatttat 480 

cctgccttct ccacaagcat gtcagtgctc cagagtttgc aaccagtggc gtggcggcag 540 

tgcttgcagg gacaacagaa gcaattttca ctccactgga aagagttcag acattgcttc 600 

aagaccacaa acatcatgac aaatttacca acacttacca ggctttcaag gcactgaaat 660 

gtcatggaat tggagagtat tatcgaggct tggtgcccat tcttttccgg aatggactca 720 

gcaatgtctt gtttttcggc cttcgaggtc ccattaagga gcatctgcct accgcaacga 780 

ctcacagtgc tcatctggtc aatgatttta tctgtggagg tctattgggt gccatgttgg 840 

gattcttgtt ttttccaatt aatgttgtaa aaactcgcat acagtctcag attggtgggg 900 

aatttcagtc tttccccaag gttttccaaa aaatctggct ggaacgggac agaaaactga 960 

taaatctttt cagaggtgcc catctgaatt accatcggtc cctcatctct tggggcataa 1020 

tcaatgcaac ttatgagttc ttgttaaagg ttatatgaaa aaaccatcag ttaagtgcca 1080 

tttatcaact gaatagacct tctaagaaga aaaaaaaaaa 1120 

<210> 54 

<211> 886 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 039026CB1 



<400> 54 

ggccgcggct 

ccgccagcct 

attggagaca 

gcatcagaaa 

ctgatcagat 

gctttttgga 

gttcagaaca 

aggaatatca 

cacgatagag 

tggaaatgag 

ctgtcatgac 

ataatcacaa 

ccttatgatt 

tgactgttaa 

ttactcaaag 

<210> 55 
<211> 2336 



cctgtccaga 
tgtactgatg 
ttcaagcaaa 
aggtgactaa 
aaaacagttt 
ctgtgttaaa 
ttgcttacag 
tattcagcag 
aagtcctgat 
gactcatctg 
tgtttggcaa 
tagaaggtct 
cagcagcttg 
ttttaaagca 
taaaataaat 



ccctgaccct 
tcggatgcga 
ggttggacaa 
taaaggtacc 
aaggaatttc 
gacttcacaa 
aaatatttaa 
aatgaagccc 
ggatgaactt 
atagaatccc 
atggaaaccg 
tattgttcag 
gtcacttgat 
acttatgtgt 
ggaaatatca 



ccctcccaag 
gagcctgtgc 
ctacttttcc 
agaagaatat 
tggggaccta 
caagagaagt 
aaatgacaca 
tggcagccaa 
ttgatgaaag 
ctgaaagcag 
ctggagaaac 
tgaaataata 
tagaaaaata 
tcgatcatgt 
c tgaaaaaaa 



gctcaaccgt 
ttaagtaaga 
agaacagaaa 
ggctgcacaa 
caataaactt 
aaaacctgaa 
aagaatatcc 
agcaggac tc 
attgccaaca 
tagccaccat 
aaaattgcta 
agatgcaaca 
aaccattgtt 
atgagataga 
aaaaaa 



cccccaacaa 
atcaggcctt 
ggaaactcat 
ataccagaat 
acagagacct 
gagaccacct 
atgagatttc 
cttggccaac 
gctgctttat 
gttcaaccat 
tttaccagga 
tttgt tgagg 
tcttcaattg 
aaaattttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

886 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 260607CB1 



<400> 55 

taatacgctc 

tttaggatat 

gactttgaag 

gttgcctttt 

gccactttta 

ccttatcaat 

ccttgtttat 

gggggatgct 
ggctgagaga 
tctgaggagg 
ctcctggggg 
tctgtcggct 
cctgttgtgg 
ttcaggcctg 
cctcctccca 
ctcacagagc 
ctaggcaacc 
aagtgtttat 
ttcttgaaaa 
aagaaaaatc 
aatttagatg 
agtttctttc 
tcatgtctga 
cgctttgggg 
ggacaaaaga 
gtgttttgat 
cttacagtaa 
ggacaagtga 
ccaggatccc 
ctttccttgc 
gctttgttgg 
tggagtgaga 
ggggaaacca 
ggcccttttt 
ggatcctttg 
gtagttgggg 
tcctcataga 
attaggtagt 
aggaaaaaaa 



actataggga 
atttttaaac 
ctgcatctgc 
gcagtttggg 
aggacaggga 
cttttttcct 
gcctcactat 
ctagttggtt 
ggctgttcct 
aaggcctcct 
attagggtgt 
cctggttgca 
gactgctctg 
gcagacgggc 
acccagacat 
cccagagggc 
ccttcctccc 
ggggagatcc 
ccagtagaag 
ctcatcaatg 
cagaacattt 
ctttggatac 
agggggattt 
cacagtctga 
aaaaaaattt 
taaccgtagt 
gattggaact 
gttgaggctc 
ccccatatcg 
ccatgtgtca 
accagatggt 
cctggggctt 
gagctgagac 
cagtcttcaa 
ccctggggga 
gagttccacc 
agccccattt 
ggcagctcca 
tattaccttg 



atttggccct 
tctttgaaac 
cagttacacc 
ttgtggactc 
gttttatagc 
ttttcctgac 
tggcagaaaa 
ctgtgtccat 
gaatcaaagc 
ggaaactggg 
ggtgtttgcc 
ctttggggaa 
tcccctgccc 
agtgaggagg 
accctctgcc 
ctgcttgagc 
acctctcatc 
tagtggcttt 
ggaaacagca 
ccaggggaca 
ctctgtattc 
ataaggcttc 
aggggcgaga 
tggcgccttt 
tttttcttga 
gatggatgct 
aagggcaggg 
gtgccacaaa 
attgggctgg 
cccctttggc 
gaggaaagga 
ggcctggaac 
ctcttcaaca 
tgggggccca 
gcctgagggc 
gggcgatccc 
cacttttgtt 
cttgcttagg 
agaatacttt 



cgagcagtaa 
acataaccaa 
ccaaatggct 
agctcctgtg 
ccttttctac 
cccctccttc 
gaccccattt 
tttcctctgt 
aatagccagc 
agctaagggc 
aagccaaggg 
ggagaggaag 
ctactgcaga 
ggcctcagtt 
aaactgggaa 
cttcttgctc 
aggggtgggg 
gccattcaaa 
cagcctgtca 
taaaagccat 
agacttagag 
tctatcgggg 
gccgcagccc 
gctggcgcct 
atgaaatagc 
acgagtataa 
actcatgcat 
aggtttgtcc 
gagggctggc 
ctccagcttg 
atggcctctt 
ccaccacaca 
gggtttcttt 
gttggctcta 
gcggtccttg 
agcccctccc 
ttatccacct 
ttaggggggg 
ccaaaaaaca 



ttcggcacga 
aacggtttga 
ttaatcccct 
aggggtctgg 
tttcctcccc 
tggaggcagt 
aaaacccaga 
gccaaagaca 
tttcgacaca 
gaggcccttc 
gtagggagcc 
tttggggctc 
gatagcactg 
agctctcaag 
ccagcagtgc 
cacaggagaa 
gttctccttt 
ccactcgact 
cagtaattgc 
ttcccttcca 
taacaccagc 
tacgggacag 
tgaccctcgg 
tagcatggtt 
aggaagctcc 
atggattaac 
aagggtatga 
ttggggaaca 
cgtgaggtcc 
tccctctctc 
cccttctaga 
gccccaaagt 
gagatcctac 
gaaggagaag 
gactcattca 
cacccaccct 
cttagcaaaa 
aaaaagattt 
aaatttaaaa 



ggaccatctc 
ttcactgact 
ctcgggtctg 
ttaggagaga 
tcctcccagt 
tgggagctat 
gaacactgga 
gacagacaga 
tacctggctg 
cct tcagagg 
gagaaattgg 
caggtagctc 
ccgagttccc 
ggtgccttcc 
tagtaactac 
gctggtgccc 
ctttcccctg 
gtttgcctgc 
aggaagattg 
aatactcgac 
tgaaaactgc 
ggaggaggcc 
tcctgtgcac 
gactccggat 
tcgggagcat 
tacctcaatc 
atcccagcca 
ggcaggcctg 
ccactttctg 
actttctata 
gggggctggc 
caggaagcct 
acctccattg 
aggtgaagca 
ggccgtcttt 
ctaatggacc 
caatagataa 
ctttttccaa 
aaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

cnn 

\J V V 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2336 



<210> 56 
<211> 2200 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1429651CB1 



<400> 56 

gagaacttta 

ctcgttcagg 

ggatgcgttt 

cctctttgtc 

aggctacttc 

cagcaatgtt 

ttctgtatca 

cctacccatc 

cggtggcatc 

caagatctcg 



cgcctggatc 
accatggaga 
ccccagaagg 
ctggctgttg 
ctggctggag 
ggaagtggac 
gcttatgaac 
tacactgctg 
agaatcccca 
gtagacatgt 



tcatctaact 
gcggcaccag 
gcttggagcc 
gactatggtc 
gggacatggt 
atttcattgg 
ccaatggctt 
gtcaggccac 
tcatcctggc 
atgcaggtgc 



gacacagaaa 
cagccctcag 
tggggacatc 
cacagtgaag 
gtggtggcca 
cctggcaggg 
gttttctgtg 
cacgatgcca 
tgtactctac 
catctccatc 



ccctgtaagg 
cctccacagt 
gcggtgctag 
accaaaagag 
gtgggtgcat 
tcaggtgctg 
ctgatgttgg 
gaatacctac 
ccatctacct 
cagcagtctt 



atccagaggt 
cagacccccc 
ttctgtacct 
acacagcgaa 
ccttgtctgc 
ctacgggcac 
cctggacctt 
ggaagcgcct 
acatcttcac 
cgcacctgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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tctgtacctg 
cctggctgct 
caccttgatg 
cttcttggcc 
tgccttccat 
tggaatgtcc 
tctggctgcc 
ggtgctgccc 
tcaagtggcc 
ttcggacatc 
gatgatggct 
cagcaccatc 
gctcatgatt 
_ccctgtggtc- 
ctacctgcag 
tgaaaagggt 
cctggacttt 
gaagagcatt 
ctccaccgtg 
gtttactcgt 
tcttaccctc 
gatggttcaa 
agtggtgaag 
ggccagagca 
ggacgtcaac 
gtgtggggtg 
aaagaaaaaa 



gccatagttg 

gtgatctaca 

ggctacagtt 

ccggctagca 

attttccgag 

atcccatccc 

aagaacctgt 

ctcctcataa 

tgtgcagatc 

gcgtatccca 

gtgatggtgg 

ttcaccatgg 

gtgggcaggg 

eaggccagcc- 

ccgcctgtgg 

gccttctggg 

atttacgtgc 

cactacctct 

agctggttca 

cacgaccccg 

tctcagaacg 

gaaaacacgt 

gccatcctgt 

gaagccatca 

cccattttct 

aacccagggg 

taataaagct 



ggctactggc 

cggatgccct 

tcgccgcggt 

accggagtga 

atccgctgac 

tctggtactg 

cccatgccaa 

tggtgttccc 

cagagatctg 

aactcgtgct 

cggctctcat 

acctctggaa 

tgtttgtgct 

agggc ggc ca~ 

cggtggtctt 

gcctgatctc 

agcctcgatg 

acttctccat 

cagagccacc 

tggtccagaa 

ggatgccaga 

ctaaaaccca 

ggccctgtgg 

tagtttccct 

gcgtgagctg 

cccaaactct 

tttgtttacc 



catcactgct 
gcagacgctg 
tggtgggatg 
gaacagcagc 
atctgatctc 
gtgcacggat 
aggaggtgct 
tgggatggtc 
ccagaagatc 
ggaactcctg 
gtcctccctc 
tcacctccgg 
gctgctggtc 



gtatacacgg 
atcatgctta 
gaaggactga 
tgcgggctgc 
ccgtggccgg 
caggtgattg 
ctgatggctg 
agccgcatcc 
tgcagcaacc 
cccacagggc 
acctccatct 
cctcgggcat 
ctggtctcca 



ttgctggtgg 
caggagcgct 
aggagaagta 
cccgggaaga 
gggtcctatt 
tccagcggac 
catacctgaa 
tcttcccaga 
cctcaggctg 
tccgtgggct 
ttaacagtgc 
ctgagaagga 
tcct.ct .qgat. _ 



gctcttcatc 
catcatggga 
gggcctgctc 
cgaccagcca 
gatcctgtcc 
ctccaaggag 
ggaacaagca 
ggccagcagc 
cagctgtgac 
aatacaggag 
ggaagaaaac 
cgccatcttt 
gtttctcttc 
aaaaaaaaaa 



tatatccagt 
tgtttctgga 
ctgggcttgg 
gatgagcgcc 
acggtcaccc 
atggtcagcc 
ccaccagcag 
agcagcagcg 
atgaccccaa 
aagggcaagg 
cccttggtga 
atctggggct 
agtgctccat 



ccatcagctc 
agaggaccaa 
ttaggctggt 
cggtcctggt 
tcatcactgt 
acctgacctg 
ctcccttgtc 
tccagttcga 
agcagtccaa 
aagagctccc 
agaccctcct 
attttgctta 
ttttttaatg 



<210> 57 

<211> 2823 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2069971CB1 



<400> 57 

gaaagacata 

taggtgaaaa 

caacaacact 

tagaaagtgt 

caacaacgac 

tatcttgtga 

aaatgaacac 

cccagagaga 

aagcccacag 

tcagcaatat 

cttgctgttg 

ctcttctccc 

cgagcagaga 

ctagtgtacc 

atgatcatcg 

cagagatgcc 

tttgtgcatg 

atctacttta 

ctggtcttgg 

gtgccaaaga 

ctgtctgtgc 

catgttgcta 

cttagcaaaa 

gaccccgact 

ccaaatt tga 

aggaaccaaa 

tccattcacc 

cagtagctgt 

aacagctgtt 

agaacacata 



cacacttcat 
caatgaagcc 
gatgtaggaa 
ataaataatt 
aacaacagcc 
atgataaagc 
ttcatagcaa 
gaccagagga 
aaaagggggc 
gcttcatttt 
tcacagatgc 
tgtggttgtc 
tccttggtgc 
tggcatgtga 
tttccagctg 
ttggccacaa 
cccttggaga 
agccagagta 
ccagcaccat 
gcccgaatta 
acagcctgca 
cagcagccag 
gcttcacgat 
gccctctctg 
caggccacct 
ggaagaaact 
acgaaggaag 
gctcaatcgc 
cgtaactatc 
ggcaaacttg 



gtaatgctac 
aggtaatatt 
gctcattatt 
gcagtgctgc 
gcagctcatc 
tgccaagatg 
tgtggaactc 
gctggagtca 
gaatgagtac 
catgattgca 
tgcccacctc 
atcgaagcct 
cctgctctcc 
gcgcctgctg 
cgcagtggcg 
tcacaaggaa 
tctatttcag 
taaaatagcc 
cactatctta 
cagtggtgtg 
catctggtct 
ccgggacagc 
gcactcaccc 
tgaagacccc 
tcaaacatgc 
cacgtcatgg 
aggcactgag 
aggaatgcgt 
ggcaatacca 
aactcaggaa 



ctgcaagtct 
gcaaggaggc 
ttaatttctg 
tttgcttcca 
ctggccgtca 
tatgctttca 
caacagaaac 
ggaggcacgt 
gcctatgcca 
gaggtcgtgg 
ttaattgacc 
ccctctaagc 
atcctgtgca 
tatcctgatt 
gccaacattg 
gtacaagcca 
agtatcagtg 
gacccaacct 
aaggacttct 
aaagagctta 
ctaacaatga 
caagtggttc 
accattcaga 
tgtgactagc 
tgctacgcag 
tgcaatgcac 
atccaccaac 
atatagatta 
aattcacctc 
agtcttacta 



ccctagaaaa 
tgtaatttta 
gagcctttta 
aaactgggca 
tggagtttct 
cactagaaag 
cggtgaataa 
accactgcca 
agtggaaact 
gtgggcacat 
tgaccagttt 
ggctgacatt 
tctgggtggt 
accagatcca 
tactaactgt 
atgccagcgt 
tgctaattag 
gcacactcat 
ccatcttact 
ttttagcagt 
atcaagtaat 
ggagagaaat 
tggaatctcc 
tcagtcacac 
tctctgcacc 
attttatcta 
caattggatt 
ttcctgagtg 
ccctccaaca 
gaaatcagtg 



gcagtttttg 
gcagacctac 
attttttctt 
gtgagttcaa 
tgaaagaacg 
aaggagctgc 
agatcagtgt 
cagtggctcc 
ctgttctgct 
tgctgggagt 
cctgctcagt 
tggatggcac 
gactggcgtg 
ggcgactgtg 
ggttttgcac 
cagagctgct 
tgcacttatt 
cttttccacc 
catggaaggt 
cgacggggtg 
tctctcagct 
cgctaaagcc 
agttgaccag 
cgtcagtttc 
acagaaaata 
tttatttagt 
atataccgat 
gagccgaagt 
atgcatctcg 
gaagggacaa 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
_1 3.8.0- 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2200 
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780 
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960 

1020 
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1560 

1620 
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1740 

1800 
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atagtcacaa 
taaaagtgac 
gtgatttttt 
actaattttt 
cttgtttatt 
agtgcttaaa 
aacagccgac 
atttacctgc 
gtgagcactg 
cgcagcaagt 
agatgtgcag 
cttctgcatt 
ttgtcaacca 
agtatcatga 
tgggcatggc 
catatggagg 
aggcacaagg 
teg 



aattttacca 
tetgtatget 
tttctactct 
atttacttgt 
tcaaaacttt 
cactggcacc 
aaaaacattc 
aggaaggaat 
ctttgtgcta 
gcttctgtgt 
gccaacattc 
tttaaaagtt 
caggagttca 
attgeaatga 
cagtgatgtt 
ctctccagga 



aaacattaga 
aacgccacat 
gaattggaaa 
cacattttgg 
atataatcac 
agecaaagaa 
gagttgaccc 
aaagcagatg 
gacattgggc 
tctggtagca 
tggaaatcct 
acccagagat 
ttgagtggga 
tgtagtgggg 
teaegtcatt 
agacgaagaa 



aacaaaaaat 
tagaacttgg 
tatgtatgaa 
caataaatcc 
tgttcaaaag 
tgtggttgta 
caccaagttg 
caaccaattc 
ttagcattga 
actcaacact 
atgtcactgg 
gcttctaaag 
cagctagata 
tataaaagga 
gaggtgacag 
gagaaggaca 



aaggagagee 
ttctctcacc 
tatacagaga 
ctctcatttc 
gaaatatttt 
gagacccaga 
ttgecacaga 
attcagtcca 
aactataaag 
atctgtggag 
gtttggtttg 
atgagecata 
catacattgg 
aagcgatgga 
ctctgctgga 
ttctaggcaa 
tgaaaaacrcct 



aagtcaggaa 
aagctgtaat 
agtgcttaca 
taaattctaa 
cacctaccag 
agtcttcaag 
taatttagat 
cgagcatgat 
aggaatcaga 
agtaaactga 
gaaacctgga 
gtctagaaga 
cagctacaat 
tattgeegga 
ctttgaatta 
aaagaagact 
agatacaaaa 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2823 



<210> 58 

<211> 1491 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2329339CB1 



<400> 58 

cgcctccctc 

gggatcttgt 

ctgcacagcc 

aggctctget 

tggtggtgga 

acttcagagg 

acggtgtgtc 

agctcagcct 

aaattatcca 

atgtctaccg 

atatcctcaa 

agaaggtcta 

atgagggggc 

tgcacgagcc 

tgeagegget 

tcatgtacca 

ggteggaett 

tcagcgacca 

ccctggcccc 

ttggagggga 

cacggcacaa 

tgtgtgttgt 

atgctgtgga 

gggaggtgta 

gcaagcttat 



cagctgegag 
ggctgtcagg 
ctcacaccca 
ccgagggctg 
gggcagacac 
aatgtttgct 
etacaatget 
gagcaatgta 
tttctgctgt 
gctggcagag 
aaactttgtg 
ctccctcctc 
ccttctctac 
cccaaagctc 
gcatgacaag 
ceggaacgag 
ccagtgcg.tt 
ggecaagtat 
ccgcatgtcc 
caacaatgtc 
ccgctggttc 
aggcaggtac 
gcgctacgac 
tgcccacgca 
tccctttagt 



tgcggcctcg 
atggcagagg 
cactgcgtga 
ctggctctcc 
atcgaggccc 

gggggattga 

atgtgccaaa 
caagagacac 
gatttcctca 
ctgtttgact 
gccttctctc 
agcagcaatc 
cattatagee 
cttgagacag 
ctggacccca 
agcctacagc 
gtgggcttcg 
ctaaacccct 
aaccagggca 
caaggatttc 
cagatccagt 
atetaegctg 
cctgccacca 
ggcgcgacgc 
gagggttaat 



gctggcggcg 
agcaggagtt 
acaacaccta 
gggacagegg 
atcgcatcct 
aggagatgga 
tcctacattt 
tggtggctgc 
tgtcctgggt 
tgagccgcct 
ggactgacaa 
gcctggaggt 
tggagcaggt 
tgcggtttcc 
gecctttgag 
ccagcctgca 
ggggcattca 
tactgggaga 
tegeggtget 
gagcagagtc 
ccctgcagca 
cggcgggccg 
actcctgggc 
tggaggggaa 
tttagcttgc 



gcaccaggcc 
cacccagctc 
ccgcagcgca 
aatcctcttc 
gctggctgcg 
acaggaagag 
catatacacc 
ctgccagctt 
ggacgaagag 
gactgagcaa 
gtaccgccag 
ctcctgcgag 
gcaggctgac 
gctgatggaa 
ggacacagcg 
gagcccgcaa 
ctccacgccg 
gtggaagcac 
caacaacttc 
ccgatgctgg 
ggagcacgcc 
tgactaccac 
atacgtggcc 
gatgtatacc 
actggccgtc 



acagttgtaa 
tgcaagttgc 
cagcactccc 
gatgttgtgc 
tectgegatt 
gtcctgatcc 
tccgagctgg 
cagatcccag 
aacattctcg 
ctggacacct 
cttccattgg 
accgaggtat 
cagatctcgc 
gctgaggtcc 
gccagcggcc 
acggagccgc 
tccactgtcc 
ttcactgcct 
gtatacttga 
aggtatgacc 
gacctgtccg 
aatgacctga 
ccactcaaga 
acctgcggac 



<210> 59 

<211> 986 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 2540219CB1 



60 

120 

180 

240 

300 

360 

420 
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720 

780 
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900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1491 



<400> 59 

ggacgccaac ctcggtttga agtccagggc 
gggccagtgc actgeattae aaccattgeg 
gggcggcatg gcaggagcaa tctcttgggg 



agtggctcct gcggacagcc agcataccag 60 
aggaatgagg gcctgtgccg tgtggctggc 120 
gaeagegact cctatggacg tcgugaaaag 180 
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tcgactccaa 
ccagagttac 
gcggggcttc 
ccgcggggac 
agtgttcccc 
ttggacctcc 
tggcccttct 
atccccaatg 
agctgagaag 
tcactcctct 
agtctcaagt 
ccgctcctgc 
tccagtgcgt 



gctgatgggg 
cagaaggaag 
cccatgagtg 
cacgcagtga 
tgggcctcat 
aagtggacat 
gatgcctggg 
cccactctgc 
tcctgaccat 
gtctgaggat 
aacacgtccc 
acccgtggat 
gctcctcact 



tttatttaaa 

gtcttaaagt 

cggccatgtt 

cgagcccata 

ctctgcatgt 

caattagcaa 

atgcctcatg 

taggctggca 

gagcaccagg 

ggggaggggc 

cgtgcctcca 

cctgaggaca 

.gt.gca_c_t.ta.t_ 

tgaaag 



tcaggtctgc agcaaataaa 

<210> 60 

<211> 4023 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2722462CB1 



caaatataaa 
gtttttcaga 
ccttgggtac 
agcgccagga 
gaagccctga 
gcgtgggcta 
agtcactgat 
tcaaagagct 
gagccagaaa 
cagtgagctc 
gtctcctctc 
gcggtagcgc 
.tagtgtctgt- 



ggtgtcctgg 
ggcatcactg 
gagctgtcgc 
ggtgaacaca 
gagctgcaga 
ggatggtgca 
tcaagccctc 
ttccaagaaa 
ccacccagag 
tgggctcagc 
agcaccgacc 
cttcctcacc 
-tgagtgat-ta- 



actgtatctc 
tgaacgcggt 
tgcaggctat 
ggatgactac 
tgtttgccct 
gacactgacg 
caaggttctg 
tgtttggtcc 
aaacgttgct 
cactccctcc 
aggtttttcc 
gcacgctgag 
aatcacatcc" 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
"960- 
986 



<400> 60 

gtccggggcg 

gctctggggg 

aggcagcaat 

atgccaaggc 

tggccacggc 

tccggcttgg 

ttgaactctg 

ccatggtcaa 

ggcagatgtc 

ggtattgcgg 

tggacatcct 

tgttctcccc 

tgtttgtcct 

acccctggtc 

acctgcttct 

ctttgctact 

ggaggcctgc 

ggatgagtgc 

tgctcagcct 

agaaaaacag 

gtattaatat 

tcaacggggt 

agcacgtgcc 

aggttgccgg 

ccgtgacagt 

atttcatcgt 

accgcaacgt 

ccctgctcat 

tcttccccaa 

ccgtgggcaa 

tcctcaacag 

tggaaggcgt 

ttgtgaccat 

gaatgaacta 

aggaaatctc 

cctcctcgga 

attccatggt 

accaggagga 

aactcagcag 

agaccgttac 

tactctaccg 

tttatataca 

tgtcttcctt 



gcgcgtatgg 
tgagagttcc 
ctacctccag 
gctggcggcc 
cctgctgctg 
catctatgtc 
catgaagtta 
gacctcggtg 
ccatgtgcgg 
tggcgtccgg 
cctgctgctg 
taacccttca 
tctgaccaca 
ctgcgtcttc 
ggctgtggtg 
gcacaagcga 
cggcatctcc 
cagggagcgc 
aaaggacttt 
agagcactgg 
ccttgtgaag 
ctggatcctc 
ctggagttac 
cctgggccct 
gttcgccttc 
ggtcctgcgc 
gctggacacc 
cttttactac 
ctgctgcaac 
caggaccgtg 
ctttgcgacc 
cacc tctcag 
ggtggtgatg 
cagccgcaag 
caaagaagag 
tgtcaccagg 
gtttctggga 
gatccaggag 
cagtgcagcc 
ctagcccagc 
cgatgtacgg 
ggaagaaaaa 
aactccagaa 



cggcactacg 
ccctccagcc 
gaaggcgaga 
tacctctttg 
ctgctgctct 
cacgccaccc 
cgctggctgg 
ctggtggtgc 
gtgacccgag 
cgcaacctgc 
ctgttcttca 
gacccctact 
gccaatttcc 
ttcatcgtgt 
ttcgacacct 
accgctatcc 
tacaggcagt 
tatcttacct 
tacgatatct 
tttgatgagc 
tccaaggcct 
gtggagacat 
ctcgtctttc 
gtggagtact 
ctgggactgc 
cccctccagc 
atgttcgagc 
tccttcgcca 
acgagtacag 
gtggaggaag 
ctgtttgagc 
acctcccact 
acgatcattg 
aaccaggact 
ctggttgccg 
c tgctggaga 
cggcgatcaa 
tggtatgagg 
cccgccgccc 
gcccgaaagc 
aactgcggcg 
agacagacaa 
gccagcttgg 



ccatccacga 
ccgcacacaa 
acaacgacaa 
cacacaatca 
ccctgtgcga 
tggagctgtt 
gcctccacac 
agtttgtcga 
cactgcgctg 
ggcagatctt 
tgatcatctt 
tcagcaccct 
cagatgtgat 
acctctccat 
tcaatgacat 
agcatgccta 
ttgaaggcct 
tcaaggcccc 
acgaagttgc 
ttcccaggac 
tccagtattt 
ttatgctgaa 
taactatcta 
tgtcttccgg 
tggcgctggc 
tgctgaggtt 
tgctgccccg 
tcgtgggcat 
tggcagatgc 
gctactatta 
tcacagttgt 
ggagccgcct 
tcgcctttat 
cggaagttga 
tcctggagct 
ccctctccca 
ggaccaagag 
agcatgccag 
agcagccccc 
cgtctcttcc 
tgtgtacaca 
gatggggct t 
cgaggggcgg 



ctcccaggcc 
ctgggagatg 
gttcttcacc 
cctcttctac 
ggcccccgcc 
tgccctgatg 
cttcatccgg 
ggccatcgtg 
cattttcctg 
ccagtccctg 
tgccatcctc 
ggagaacagc 
gatgccctcc 
cgagctgtat 
tgagaaacgc 
ccgcctgctc 
catgcgcttc 
gaatcagaac 
tgctttgaag 
ggcgctcctc 
catgtacttg 
aggtgggaac 
cggggtggag 
atggaacttg 
cctcaacatg 
gtttaagttg 
gatggccagc 
ggagttcttc 
ctaccgctgg 
tctcaataat 
caacaactgg 
ctacttcatg 
cctcgaggcc 
tggtggcatc 
c taccgggag 
gatggagaga 
cgacctgagc 
ggagcaagag 
aggcagccgc 
a tgcaataac 
tactcacgta 
ggttcataac 
gggtgcggcc 



cccagtctca 

aattaccaag 

caccccaagg 

ctgatggagc 

gtccccgcac 

gtggtagtgt 

cacaagcgga 

gtgttggtac 

gtggactgtc 

ccgcccttca 

ggtttctact 

atcgtcagtc 

tactcccgga 

ttcatcatga 

aagttcaagt 

atcagccaga 

tacaagcccc 

aacacacccc 

tggaaggcca 

atcttcaaag 

gtggtggcag 

ttcttctcca 

ctgttcctga 

tttgacttct 

gagcccttct 

aaggagcgct 

ctgggcctca 

tgcgggatcg 

cgcaaccaca 

tttgacaaca 

tacatcatca 

accttttaca 

ttcgtcttcc 

accctcgaga 

gcacgggggg 

taccagcaac 

ctgaagatgt 

cagcagcgac 

cagcgctccc 

acaatagtat 

tatgcacata 

caccttgccc 

accaggtccg 
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agctcttcct 
ccagtcgact 
tgtcactgct 
ccctgcggcc 
gaaagaactc 
tcttcgggga 
ggccagagct 
ttggctctgc 
tccacagtgg 
ggccgatgtc 
ttgcgattta 
gataccaggt 
ccctgtggga 
tgctcacacc 
ctcagttaac 
tctgagcgtg 
tcagtgcaga 
ttctgcagca 
gggcattcct 
tggacagagc 
ccgaccccag 
aagctgggtc 
gagttggaca 
tattgagact 

ggg 



actgtggaag 
gcggggcttg 
actgcttcag 
catgccacag 
ccggtctcca 
ccagtgccca 
cagttagcca 
aagcctccca 
cctccttggg 
tgcaacccag 
cccgttcaag 
ccttcacacg 
cgcagcagcc 
agagatttcc 
gtcgccaccc 
gtggaggtag 
ggtcgctttg 
ggagccgcag 
ggccagcccc 
ctccctctgg 
gggagcagta 
ccatagccag 
agtagcaggt 
cactgtgatt 



gctccagaag 
cccctcatgt 
gctcacaccc 
gtttctgtgt 
gggtggtatt 
ggatctaatg 
gtgccgggcg 
cccagccttc 
gacccagaac 
accacttcgt 
caaaacaacg 
tgtgcactag 
ccgtggcccc 
atagcaggag 
ctcctcccac 
gagggttgcc 
gttccgcttc 
tggcaggatg 
ggccctctgg 
ccttgtcctc 
cctggtcccc 
cacggcatgg 
gtttgttttt 
cccccgggag 



gcccttcaca 
gggccggcct 
ccccgacctg 
tttgctttag 
tcagtgtctg 
gaagcggaat 
gccacagatt 
tctggcttaa 
ccggaggaag 
ggaatgggct 
ttttggttaa 
gaacaggagc 
ggcccagttc 
cggttggtgc 
catggtaccc 
agctgcagtg 
cctgggccac 
gagggtgcga 
tgcctgcttc 
ttcccagcca 
caccccctcc 
ttctcccctt 
aaagcacagc 
tcagactggc 



aggagacccc 
ccatcggcca 
atggcgcgcc 
ggacagaacc 
tgataatgtc 
tggggcaact 
acactgacca 
cccttgttgg 
gggcatgagg 
cttgaccaaa 
ctaaggattg 
gaacagcaca 
ccagccaccc 
agaagtaggt 
tgtaggagcc 
accctgccac 
agaacggaac 
ag ggcaagga 
ctgtgacttc 
cagaacgggc 
tcccaaccac 
ccccccttcc 
cctttgggaa 
tttgtctctt 



tcacctggat 
cgtccaaagc 
cgccccctct 
acttaggaaa 
acgcaacacc 
gggcccgtgt 
atctcctccc 
cgaaaactct 
caggaagtgg 
tccctttttt 
tgctaaagcc 
gagagacgct 
tccctggctc 
tcagatgaac 
ctgtatgaca 
agaggcaggg 
acagcatagg 
gtgcactgct 
agaaggcagg 
agggtggcac 
ctccaaggcc 
caggtcaggg 
agcaacacat 
tctctctgga 



<210> 61 
<211> 2345 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> raise feature 

<223> Incyte ID No: 2739264CB1 



<400> 61 

gaaaggaggc 

cgcgctgcgc 

ttttggggct 

gtttggagga 

aactaacaca 

tactggtttt 

actgggattt 

cagccagcct 

ttctgctcca 

geaggecttt 

cacacaagaa 

taaagatgaa 

ccaacaacaa 

tgtaaatgta 

tgttgagcgt 

ttttgaacaa 

tagaacagaa 

tgatcctatt 

tgtaccaatg 

taagcagcat 

tcaaactaca 

tagaacttta 

ggctgatgaa 

tactcagttc 

tggagcagtc 

gcagcatctg 

tctagaagat 

tgtctttagt 

catcagacct 

cLtggattgg 

caccccacag 

ttaaaagaag 



ggaactcggt 
gtegcaggag 
ccctcgggca 
tttgggacaa 
ggcactactg 
ggcacaacaa 
ggaggattta 
acacaagctc 
acgctgttgg 

tggggaacag 

aatccctttt 
gatgggctag 
cagttggtag 
gagggcacta 
tegecaaatg 
gecaatataa 
ctttctcctg 
atctgggaac 
gtgggtttta 
caaaccagat 
tetgtageca 
caggtcctaa 
gagcagttgc 

a ^gggccgac 

agatctgaag 
aaacaacaac 
ataaagcegg 
tgacagttca 
tccttaagaa 
ttttttgaga 
ctcttcaaag 
tgcataatta 



gatctzgactg 
taacctactt 
cctccggtac 
catctacaac 
gactctttgg 
egggaactag 
atacacagca 
ctacccagtc 
gagatgagag 
gaaaagggta 
gecgatttaa 
tggttttagt 
aatcattgea 
aaacattgee 
gtacttcaag 
aaacacaatt 
cacagatcaa 
aggecaaggt 
aggaacttct 
tagatatcat 
aaattgeaca 
tcaaacagga 
gagttcagct 
taaatgaatt 
aaaggcatta 
aggaaggece 
ccgaacatgg 
caaacttgtg 
tgaaaccgac 
agtttaccga 
aaaacctttg 
ccaaaactgc 



gcggtttccc 
ggtctcctgc 
cgctgcagcc 
tgcaggttct 
tggtactcag 
tactggttta 
gcagcagcaa 
caaccagctg 
agatgetatt 
tttcaacaat 
ggcagtaggt 
tttcaacaaa 
taaagttttg 
agatgatcag 
aagagttcca 
gcagcaactt 
acagctttta 
agataaccct 
ccgaagactg 
atccgaagat 
atacaagagg 
aattcaaagg 
ggatacgatt 
gatgtctcaa 
catagatgea 
tagecacttg 
accgaatgaa 
taaaggtxtg 
cacatggagg 
caaattactg 
aaagacttac 
atgeactact 



ggccggactg 
tttcgegaca 
accgcggccc 
gcattcagct 
aacaaaggtt 
ggtactggtt. 
actacattag 
ataaatactg 
ttggcaaaat 
aatattcege 
tatagttgea 
aaagaaacag 
ggaggaaacc 
acagaagttg 
gctacaacgc 
ggtgtaaccc 
cagaatcctc 
gattctgaaa 
aaggttcaag 
attagtgagc 
aaactcatgg 
aagagtggtt 
cagggegaac 
ateaggatge 
gatctgttac 
actagcatca 
accacccaca 
tgaaaegcat 
gaaaaagaaa 
tccatcaaat 
atctaaaagc 
gtacattttt 



agaaggggag 
tggccttcaa 
ccgcgggtgg 
tttctgcccc 
ttggatttgg 
tgggaactgg 
gtggtctctt 
cgagtgctcc 
ggaatcaact 
cagtggaact 
tgcccagtaa 
agattcgaag 
agacccttac 
ttatttatgt 
tatatgecca 
tttctatgac 
ctgctggtgt 
agttaattcc 
atcagatgac 
tacaaaagaa 
atctttccca 
atgecattea 
taaatgeace 
agaatcactt 
gagaaaccaa 
ttaaagacga 
tcagaggtgg 
cttcttactg 
acaattcttt 
ccgaaacagt 
tgtatttact 
acaacagcat 



2640 
2700 
2760 
2820 
2880 
2940 
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3060 
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3180 
3240 
3300 
3360 
3420 
3480 
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3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4023 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



46/60 



BNSDOCiD: <WO 0078953A2 I > 



WO 00/78953 



PCT/US00/16668 



tttcttaaac 
ataccaagta 
gctgttggag 
ctgtgggagg 
tatgaaaatg 
attaggatat 
tatttctaaa 
aaaaa 



ataatctgtg 
aatacgaaat 
atttgtactt 
agtaaagaaa 
tatttttata 
aacaaaacta 
aagatgaaaa 



tttaatgatt 
aatccacttt 
gagtataaat 
atccaaaagc 
ttattttact 
cttaaccatt 
cttataaata 



attgtccatt 
gcacataaca 
gtacaccagt 
acttaatgtt 
tatttggaat 
tttgcaacca 
aattcttaat 



gagcctgtac 
gaacaaacta 
ttttgtattt 
ttgtttttgt 
ttacagagca 
ttttgttttt 
ttgtaattac 



tctgccttcc 
taattacttg 
gtgaactcat 
tctataaaga 
cacctaagca 
taagcctttt 
ttttaaaaaa 



<210> 62 
<211> 2085 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2758310CB1 



<400> 62 

cggagatgtg 

gcggccgggc 

cacgatcgga 

gatacctgtg 

tggccgtccc 

agccccacac 

ctcggagccc 

gatggcatgg 

tggtggtgca 

tctgcgtgtc 

ctctacgtga 

acccggaccc 

gagttgttcc 

cagccaattg 

agcgtggctg 

accctggagc 

gtcctccagg 

aacgcagagc 

cactgggaca 

aacgatgtgg 

atcaaggcct 

ggcaacgttc 

gggctctacc 

atcatcctgc 

tacctcctgt 

tcgcacctac 

gaggccgtca 

atcaacaccg 

atggctaaaa 

tacggcacca 

gcgctcaggc 

cacctgagcc 

gccgcccagg 

gtgtagacac 

aggaggccgt 



gcgaccgttt 
ctgtctgtgg 
catgcatgtg 
tgcagctctg 
ctgctgtgtt 
ccctgtggag 
gctatgtcat 
ggcgctttgg 
ccaatgttgt 
accagcgggc 
agctgggcca 
tgcgcgtgct 
ttgaggactt 
ctgccgccag 
tgaaggtgca 
tcctgctgcg 
acctgaaggg 
gctgtgcgcg 
agtccagcaa 
aggccatcag 
ttgctgagca 
tggtgcggaa 
agttcctgga 
gggacgacgc 
tcgccgagat 
tgagccgcga 
tggcggtgct 
tgcgcgctat 
gggctgtccg 
gcctcctgcg 
tggagacctt 
tcgtgccccc 
gccggcgggg 
cccgagcccc 
gtaatgacca 



ctggcatcat 
gagctgcatc 
gttctgcggc 
tcatttccac 
cttcaggaga 
gaaggtgctc 
ggcagaggca 
caggtctctg 
ccttcgaggg 
ggctgatgcc 
ggggctgtgc 
agaggacagg 
ccaggccctc 
cctggcacag 
gtacatcgac 
gctcgttgag 
gaccctggcc 
ggagctggcg 
gcgcgtgctc 
gagcc agggg 
gatattttac 
aggcccggac 
ggagaaggac 
cgccatgagg 
gctcatgcag 
agaggcggcc 
cagggagctg 
caacgtggcc 
gggctggagc 
ccacgccaag 
ggccatgcgg 
agcggaggag 
cccttttcac 
gtgggcactc 
cacactcctc 



tctgagactc 
ctcctcatct 
caaagcacgc 
tctgctctgc 
aacgtcaggg 
tccaccgcgg 
cgggagaaga 
aaggtcggcc 
tggaagagcc 
ctggcggcag 
tccttcaacc 
gccctcaagc 
ccccacgagc 
gtgcacagag 
ctgcgggacc 
gtcatgcacc 
caggagctgg 
cacttcccct 
actgccgact 
ctggcagtgc 
accggcttca 
gggaaagcgg 
cgcgcagccc 
gcgcacgcag 
cgccccgtgc 
tacatggtgg 
ccgcggccca 
ctcggcgccc 
cgcctggcgg 
gtcgtctggg 
ctgaccgccc 
ctctaccagt 
cttgggctga 
gcactggggg 
tcaagcaaaa 



ggcagttgct 
gcaggcgctg 
cctttggttg 
tgcacagaag 
gccttcctcc 
tagtgggggc 
ggaggatgcg 
tgcagatctc 
caggctactt 
gggccatcag 
acctgcttcc 

ggggcttcca 

tcttccagga 
ccaagctgca 
gctttgatgg 
ccagctttgg 
acttcgagaa 
acgtcgtggt 
tctgcgccgg 
atgacatagc 
tccactcgga 
agctggtgct 
tctgccagct 
ccgcactggg 
gcctggggca 
acatggcccg 
tgctgctggt 
ccgtggaccg 
gcgccacgta 
agatgctcaa 
tcctggctcg 
acctggagac 
cggaggtggc 
gctgtgacag 
aaaaa 



tctcactgct 
gaaaaccaga 
tgaacttcat 
gcagaagccc 
aaggttctcc 
gcccctgctc 
gctcgtggtg 
cctggactac 
ggaggtgatg 
caacgggggc 
ccccgagtat 
ggaggtggat 
gtttgactac 
cgatggcacc 
ggacatccac 
cttcagctgg 
tgagggccgc 
gccccgcgtg 
ctgcaaggtc 
agaaaagctc 
cccacatcct 
gctggaccac 
gtggcgggcc 
ggtgcaagac 
gctgtggggc 
cgagcgcttc 
gctgcgcaac 
ctacttcctt 
tcggggtgtc 
gtttgaagtg 
tgctctggtc 
ctagggcgca 
ggggctagag 
cagctgggcc 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2345 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2085 



<210> 63 

<211> 3014 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2762348CB1 

<400> 63 

ggaggcaaag gccaggccga ggaccagggt ggcccgggtg gcagcgggga ggcgctgcat 60 

gctggaggct gtgctgagcg cccggcgcag gcgagccggt cctgcggagt tgcgccgagt 120 

gcccgctgca gaccgaggcc gggccaagat ggcgtctgtg tttcgaagcg aggagacgtg 180 



47/60 



BNSOOCID: <WO 0078953A2 I > 



WO 00/78953 



PCT/US00/16668 



tttgtcacaa 
gctcggattg 
tgtgaacgaa 
gacgcaaaat 
ggaaatgatt 
ccagaaccag 
gaagaaaacc 
ggacacttct 
gttcatagcc 
gcgagtctgc 
tcctgtgacg 
gctcaggcag 
agagcctgcg 
aatcaccgtc 
caactggcac 
catgtgcaac 
agatgccaca 
ggcccccatc 
taaattcaca 
gataaaccca 
agactgtggt 
acgcttgctc 
tctgatccta 
ctccaagtcc 
cacatggaat 
aggagtgtat 
aaacaaactc 
ccagatggtt 
caacatcatt 
ggttttcatg 
cagcatcctc 
acccctctac 
tgtgccgtgg 
gctgcaggca 
cccttctagc 
tggagaagag 
ctgcctgggc 
tcatgcacaa 
aggctgggga 
agccatcctt 
ggttgagttc 
caaacacatc 
tcaccatgcc 
aatgggaagg 
cctgtggatt 
t.tcatg.tg.g.g. 
aacagatgtc 
ttcaaaaaaa 



ctgtttctcc 
gttcagttca 
gtcagaaggt 
gagattgtag 
accctggaga 
caggccttga 
caagacttct 
ggcctcctgg 
gggtgtgatc 
cgaggaaacg 
aaagaagaaa 
aaaatcaaga 
gtggagcgca 
ataacacaaa 
tcctggctca 
atcgacgtca 

Cy tatCaaya 

atgaccacag 
gctggcttcc 
gccccctaca 
catggaaccg 
tcccagaaga 
cttatgggca 
ttgaacatct 
actcatgtaa 
tttggaaatc 
acatttctga 
tttggtgtca 
ctgcaattta 
atcattttca 
atccacttca 
aaacatcagc 
atgcttctga 
tccaggatcc 
cgttctggcc 
ttcaactttg 
tgcatttcaa 
ctgtctgaag 
ggaatcgtcg 
ctgatcatgg 
cagaacaagt 
ctggatggca 
aatgaaggaa 
ttgttcttgg 
tgatacgact 
gtattaattc 
ttcttgcaat 
aaaa 



aggtggaagc 
aagatttaaa 
gtgaatcact 
ttcagttgct 
ctgttctaga 
aacaaagctt 
ttgagacgga 
agttgaaagc 
caacagggaa 
tgtacttgaa 
ttcagaagca 
agatctgtga 
gagagatgtt 
cagagtctca 
tcaaggtgca 
cccagcagtg 

yyyCaCtyya 

tgcaatctaa 
agaatattgt 
ccatcatcac 
tgatgctcct 
cagacaatga 
tcttctccat 
ttggctcttc 
tggaggaaag 
catacccgtt 
actcgtataa 
tcctcagcct 
tccctgagat 
aatggtgctg 
tcaacatgtt 
aagaagtcca 
ttaagccgtt 
aagaagatgc 
agaggacttc 
gagacgtctt 
acacagcctc 
tgctctggac 
gggtttttat 
agggcctctc 
tctatgtcgg 
cagccgagga 
gttcagcctt 
ctcacctgaa 
taaccacgtc 
tcaaataata 
attttaaata 



tgcatattgc 
tatgaatgtg 
ggagagaatc 
cgagaaaagc 
aaaactggaa 
cctagaactg 
aaccaattta 
agtgcctgca 
gaggatggct 
gttcagtgag 
catattcatc 
tgggtttcga 
ggagagcgtc 
ccgccagcgc 
gaagatgaaa 
tgtcatcgcc 
gcaaggcatg 
aacagcccct 
tgatgcctat 
tttccccttc 
ggctgcactt 
gatttggaac 
ctacacgggt 
ttggagtgtc 
tctatatctg 
tgggattgat 
aatgaagatg 
tttcaatcac 
gatttttatc 
ctttgacgtc 
tctgtttaac 
aagtttcttt 
tattcttaga 
cactgagaac 
tgcagatacc 
tgtccaccaa 
ctacctgcgg 
tatggtgatg 
tatttttgcc 
tgctttcctg 
ggatggttac. 
gtaggctgag 
gtctttgata 
gcatgaaact 
agaggaagga 
aagtaattga 
ttgtatttga 



tgtgtggctg 
aacagctttc 
ctccgttttc 
ccaccgaccc 
ggagagttac 
acagaactga 
gctgatgatt 
tatatgaccg 
tcctttgagc 
atggacgccc 
atattttacc 
gccactgtct 
aatgtgaggc 
ctgctgcagg 
gctgtctacc 
gagatctggt 
gaactaagtg 
cccacattta 
ggtgtcggca 
ctgttcgctg 
tggatgattc 
accttcttcc 
ttgatctaca 
caacccatgt 
cagctggacc 
ccgatttgga 
tcggtgatcc 
atatacttca 
ctgtgcctgt 
cacgtatctc 
tacagtgact 
gtggttatgg 
gccagccatc 
attgaaggtg 
cacggggctc 
gccatccaca 
ctctgggccc 
aacagcggcc 
gtatttgctg 
cacgccctgc 
aagttttctc 
ggctgcacct 
tcagcccctg 
gtgtattatt 
ctttggcaag 
caaatgaggg 
gaaaataaac 



agctcggaga 
aaaggaaatt 
tggaagacga 
cgctcccacg 
aggaagccaa 
aatacctcct 
tctttactga 
gaaagttggg 
ggttactgtg 
ctctggagga 
aaggagagca 
acccttgccc 
tggaagattt 
aagccgctgc 
acatcctgaa 
tcccggtggc 
gctcctccat 
acaggaccaa 
gctaccggga 
tgatgtttgg 
tgaatgagag 
acgggcgcta 
atgactgctt 
tcagaaacgg 
cagccatacc 
acttggcttc 
tgggaattgt 
gaagaactct 
ttggatacct 
agcacgcccc 
cttccaacgc 
ccttgatttc 
ggaaatccca 
atagctccag 
tggacgacca 
ccatcgagta 
tcagcctggc 
ttcagacgcg 
tcctgacagt 
gactgcactg 
cattctcctt 
cccacggtgg 
caaggcgctc 
tggacgtcag 
tgatattgtc 
gagaatgcta 
acctgagtca 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880. 

2940 

3000 

3014 



<210> 64 
<211> 1726 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyce ID No: 3715961CB1 



<400> 64 

tgaaaatgcc 

cacatagagc 

tttctaatca 

gagtcctgaa 

accagacagt 

caagaagccc 

gactaggcca 

agccaaagct 

acatatgaac 



tgctgcgtac 
ctgcctggag 
acatgataaa 
agacaaggag 
gtgtgggaag 
acgtgttaga 
tgtcggaang 
acctttttaa 
acctccgaga 



caaggtatgt 
aagctcatgg 
aaggtcagag 
atgttagtaa 
ggcagagcct 
agcaccggcc 
tccccctatt 
cgatcttttt 
ggccgtgctt 



actagggcat 
tctrgatggaa 
agcagtttcc 
agcagagaag 
gaaaagagca 
ccatgagaca 
ttatacatac 
tacccagtgt 
ggcaaaagtg 



ctggggcaag 
agataagcaa 
gaaaaacatg 
ggagaattca 
tgccccattt 
aagacacagc 
acaagcacat 
attctggagg 
aagatgagca 



taaaaacaaa 
gaagagttaa 
tttttgagct 
ttctagaaag 
ggagaagcat 
tagagagatt 
agacacatat 
tcgaatggtc 
ctatccccct 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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tggaaatcag 
actgacctag 
ttgcgatttc 
gggctttggg 
ttaaccactt 
atcatgactc 
ccagccgatg 
cttttgtata 
agtctatata 
tggcttactt 
gatgcaaccc 
accccctatt 
gggcagcact 
_aagtgatctg- 
gaagaatcga 
ttctatgctg 
tgaagtagac 
ctgttatgtc 
gtacatactg 
cctagtgttg 



tcattggagg 

tgaaggttca 

gtggtgtaca 

caggctgggt 

atgatacagt 

acggtttatc 

tcatcaaaag 

aatcatcgac 

aaggcttttt 

atgaaaaaat 

ttaaagatac 

atttctacct 

ttattttttt 

gteggatctc- 

agcctgacca 

aagagcctgc 

gataggaatg 

agtatttatc 

taaattaaag 

aaggaaggtg 



gatgatggct 

gatgcaaatg 

tcatgcattt 

acccaatata 

gaaacactac 

aagtttatgt 

cagaataatg 

tgactgcttg 

accatcttgg 

cagagagatg 

agtgttcagt 

ctttaggaag 

ctggaaaccc 

acaaggccatr 

ctttcacctt 

ttagaggagg 

tggaagaaca 

aatgaagttt 

ggaggtgaat 

tactttttct 



ggtgttattg 
gaaggaaaaa 
gcaaaaatct 
caaagagcag 
ttggtattga 
tctggactgg 
aatcaaccac 
attcaggctg 
ctgagaatga 
agtggagtca 
attattgaaa 
acacctattc 
aagttctctt 



gccagttttt 
ggaaactgga 
tagctgaagg 
cactggtgaa 
atacaccacc 
tagcttctat 
gagataaaca 
ttcaaggtga 
ccccttggtc 
gtccatttta 
tatgggcatc 
cacagagact 
tgactcctct 



agccaatcca 
aggaaaacca 
aggaatacga 
tatgggagat 
tgaggacaat 
tctgggaaca 
aggaagggga 
aggattcatg 
aatggtgttc 
aacccctaaa 
tgcaacacat 
gatttatagg 
tttt gtccaa. 



ccaat ga ga c 
gggcaagaag 
agtaccagga 
catacatagt 
gataattcac 
ggaaattaat 
tgtcagaaag 



cccgcacagc 
gtttggcctt 
gggagccagc 
gcttaagaaa 
ttttctgtca 
gaataaacat 
ataaaa 



attttctaaa 
tgagttgcta 
atttcagatc 
tacatttaac 
ttgttaaagc 
tttgagtttc 



<210> 65 

<211> 899 

<212> DNA 

<213> Homo sapiens 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

13.2.0_ 

1380 

1440 

1500 

1560 

1620 

1680 

1726 



<220> 

<221> mi sc_ feature 
<223> Incyte ID No: 



5108194CB1 



<400> 65 

gcggcggcga 

tcagcgcccg 

ctggtatggg 

tcctgtcctc 

ctggggccct 

tcatcttcca 

acacctacct 

gcgtggtgcc 

gcagctacta 

cactggctgg 

tggccgtaac 

gagaagaggg 

cctacgctgg 

gcggccgcaa 

ttacaacgtc 



cgcgtccggg 
cgcctgtcgg 
taatggtgtg 
gtccgtctca 
ggctggtgcc 
agtgtcttca 
caacgaggga 
ctacgccgcc 
tggcttccgt 
aacgacagcc 
cccgaaggaa 
gctgaagact 
cctgagcttc 
gcttattccc 
gtgactggga 



ccggtgaggg 
gctgaactga 
aaggaaggcc 
tcaaagcgtg 
cttgccaaaa 
aaaagatttt 
tttctcagct 
atccagttca 
ggagaagccc 
gcttcactga 
atgtacagca 
ctctaccatg 
ttcacctatg 
tt.tagt.gagg 
aaaccctggc 



ggcgcggggg 
ggaccgagtc 
cggtgcgatt 
accacaggca 
cagcggtagc 
ctgccaagga 
tgtggcgcgg 
gcgcacacga 
tgcccccttg 
cctaccccct 
acatctttca 
gatttatgcc 
agacgctcaa 
gttaatttta 
gttacccaac 



gcgccggggg 
tcctgccatt 
gcatgaggat 
agtgctcagc 
tcccctggac 
ggccttccgg 
gaactcggcc 
ggagtacaag 
gcctcgcctc 
ggacctggtc 
tgtcttcatc 
caccgtgctg 
gagcttgcac 
gcttggcact 
ttaatcgcct 



ggcccaagcg 
ccgagcaggc 
gctgaggctg 
tccctgctgt 
cgaaccaaaa 
gtcctctact 
accatggtgc 
cgcatcctgg 
ttcgccggcg 
agagcgcgga 
cgcatctcga 
ggggtcattc 
agagagtaca 
ggccgtcgtt 
tgcagcaca 



<210> 66 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CB1 



<400> 66 

ctttaagctg 

ggattatcac 

tggttcagcg 

ccgggaagca 

gtccaaacct 

ggatgtgcct 

agagaaatat 

cttcattcca 

acgtt ttcac 

tgggacattc 

tacactgtat 



tagctgtggt 
acctccatgt 
gattcctaca 
attcgacagg 
gtagcatttg 
gttccaagca 
aacaatgatt 
agtccactca 
ggagggaaat 
cgagcaactc 
tccttttaaa 



ttctgcagca 
atgacaattt 
caagcaggcc 
agagagaaca 
ccgtgaagac 
cage tatctc 
ggtggatagg 
gattggagaa 
caagtggaaa 
ccacatcaac 
aatatttgaa 



attttgtttt 
gtacctgeat 
gtctgactcc 
gcaagcagct 
aaatgtgagc 
etttgatget 
aaggctggtg 
cataeggate 
ttcttcttca 
aggtgagggt 
cacacatgca 



tgccttgaaa 
ggaattgaag 
gatgtctctt 
atccagc ttg 
tactgeggeg 
aaagac tttc 
aaagagggct 
cagcaagaac 
agtcttggag 
tgtagt taaa 
age 



gaggtgetet 
acteggagge 
tggaagagga 
agagagcaaa 
ccctggacga 
tacacattaa 
gtgaaattgg 
aaaaaagagg 
aaatggtatc 
ccctttttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

899 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

643 
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<210> 67 
<211> 2574 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5517972CB1 



<400> 67 

gcgcctccca 

gaaggggttg 

caggacggaa 

agcctgtgga 

ttaagtggaa 

ttacatcctt 

ttggttaaga 

aatgtcgaag 

tccaatgacc 

cgagtaaaac 

tcgaatatca 

ggcaaatctt 

gatgttctga 

gtacaagatg 

gctcttcggc 

attcaagagt 

ggtgtgtctg 

tccatcttgt. 

cttttgctcc 

cctcgatatt 

atgttccacg 

gaggcctata 

gtggcattaa 

gataagccac 

acaaaagctg 

aaggagatca 

ttcaaaaact 

ctgggactgg 

aacagagctg 

gtggaactct 

agagtgtcat 

ccaagtatta 

gccttcttcg 

ctggccatag 

tttgtgttta 

ctgtcatggc 

gaattttfcgg 

tatgcaacac 

ggctxgtgga 

tacctgaaat 

tatcctcaca 

ttctgttccc 

tagaaatcac 



cgccggccgc 
tgccgcgcga 
gacgcagtgc 
ggaactgggt 
actgctgctt 
aattgttaaa 
ccgagctcta 
ttttcatccc 
tgaaggcatt 
tgaagagtgg 
atgggatcat 
cgttattaga 
taaatggagc 
atgttgtgat 
ttgcaacaac 
taggtctgga 
gaggagaaag 
tcttggatga 
tgaaaaggat 
ccatcttcaa 
ggcctgctca 
ataaccctgc 
acagagaaga 
tcatagaaaa 
aattacatca 
gctacaccac 
tgctgggtaa 
ttataggtgc 
gggttctctt 
ttgtggtaga 
cttatttcct 
tatttacctg 
ttatgatgtt 
cagcaggtca 
tgatgatttt 
ttcagtactt 
gacaaaactt 
gtactggcga 
agaatcacgt 
tgttatttct 
taaaaaagaa 
ttgccatcac 
aacaaactga 



cgccgacgtg 
cggtctgcgt 
tcgccactcc 
aggatttagg 
tagagtttgt 
gcgctgcctc 
ttaagctgaa 
agtgtcacaa 
tactgaagga 
ctttctacct 
gaaacctggt 
tgtcttagct 
accgcgacct 
gggcactctg 
tatgacgaat 
taaagtggca 
aaaaaggact 
gcctacaact 
gtctaagcag 
gttgtttgat 
ggaggccttg 
agacttcttc 
agactttaaa 
attagcggag 
actttccggg 
ctcctcctgt 
tccccaggcc 
catttacttt 
cttcctgacg 
gaagaagctc 
tggaaaactg 
tatagtgtac 
tacccttatg 
gagtgtggtt 
ttcaggtctg 
cagcattcca 
ctgcccagga 
agaafeatttg 
ggccttggct 
taaaaaatat 
gcactttgat 
actgttgcac 
attaaacatg 



atcgctcggg 
gctgtgccca 
cactgagatt 
aacgcaccgt 
ttggaaggtc 
cgagcgcacg 
aagataaaaa 
ggaaacacca 
gctgtgttaa 
tgtcgaaaac 
ctcaacgcca 
gcaaggaaag 
gccaatttca 
acggtgagag 
catgaaaaaa 
gactccaagg 
agtataggaa 
ggcttagact 
ggacgaacaa 
agcctcacct 
ggatactttg 
ttggacatca 
gccacagaga 
atttatgtca 
ggtgagaaga 
catcaactca 
tctatagctc 
gggctaaaaa 
accaaccagt 
ttcatacatg 
ttatctgatt 
ttcatgttag 
atggtggctt 
tctgtagcaa 
ttggccaatc 
cgatatggat 
ctcaatgcaa 
gtaaagcagg 
tgtatgattg 
tcttaaattt 
tgaagtattc 
agcagcaatt 
aaagaaccca 



cgcgccgggc 
ctcaaaaggt 
gagagacgcg 
gcacatgctt 
cgggtgactc 
catcctgaga 
ctctccagat 
atggctteec 
gttttcataa 
cagttgagaa 
tcctgggacc 
atccaagtgg 
aatgtaattc 
aaaacttaca 
acgaacggat 
ttggaactca 
tggagcttat 
caagcacagc 
tcatcttctc 
tattggcctc 
aatcagctgg 
ttaatggaga 
tcatagagcc 
actcctcctt 
agaagaagat 
gatgggtttc 
agatcattgt 
atgattctac 
gtttcagcag 
aatacatcag 
tattacccat 
gattgaagcc 
attcagccag 
cacttctcat 
tcacaaccat 
ttacggcttt 
caggaaacaa 
gcatcgatct 
ttattttcct 
ccccttaatt 
aatcaagttt 
gttttaaaga 
agaaaaaaaa 



gtggttgggg 
tccgggcgcg 
gcaaggaggc 

ggtggtcttg 

atcccaacat 
tcctgagcct 
gtcttccagt 
cgcgacagcti 
catctgctat 
agaaatatta 
cacaggtgga 
attatctgga 
aggttacgtg 
gttctcagca 
taacagggtc 
gtttatccgt 
cactgatcct 
aaatgctgtc 
cattcatcag 
aggaagactt 
ttatcactgt 
ttccactgct 
ttccaagcag 
ctacaaagag 
cacagtcttc 
caagcgttca 
cacagtcgta 
tggaatccag 
tgtttcagcc 
cggatactac 
gaggatgtta 
aaaggcagat 
ttccatggca 
gaccatctgt 
tgcatctcgg 
gcagcataat 
tccttgtaac 
ctcaccctgg 
cacaattgcc 
cagtatgatt 
ttttgttgtt 
gatacatttt 
aaaa 



60 

120 

180 

240 

300 

360 

420 

4 SO 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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2340 

2400 

2460 

2520 

2574 



<210> 68 

<211> 1571 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5593114CB1 



<400> 68 

ggcgcggcga cagctagggt tcacggccac 

tttttggtta cgggtccctg atctggaagg 

gacacatcac caactacagc aggcgcttct 

ccggcaagcc tggaagagtt gcgactcttg 



tggggcagag gagccgcgag aagatgtggg 60 
tggatttccc ccatcaggac aagctggccg 120 
ggcagggcag cacggaccac cgcggggtcc 180 
ctgaagat.cc tgcgggatgt gtatggggtg 240 



50/60 



WO 00/78953 



PCT/US00/16668 



ttgcttacag 
aaaaaggagg 
cattcagtgt 
ctctggaaga 
aatatctttt 
atcttttcgc 
gcatataatt 
tacgtttact 
aactggaaat 
aattttaata 
gagttcttta 
atcatcctgt 
aataaatata 
-atacacaagc" 
ttaattgtga 
aaattggcat 
tttttaaata 
tacaaatcat 
aactgaaaaa 
taaaacagct 
tatacataaa 
caggctggaa 
tttgttttgt 



attgccagta 
ctacagaacc 
attgccatat 
cattgctgaa 
tgaacttgca 
tttggaaaaa 
tagtcttcag 
taaagatctt 
gtcctgaaac 
acataaagat 
gaaaaataca 
ttcacatccc 
aggttatgcc 



ggaaaggaag 
acaacagtca 
attggaacat 
caaattttta 
aattctatta 
ttagtaaagg 
agaattaact 
atttttaatg 
acatatttaa 
ttcctaactt 
gtgaaggact 
aatactattt 
ttcaatatat 



t'ta'aaataac 
atttccttgt 
ttttaaaata 
aaggcatttt 
gatcactaat 
gatgggctga 
tttggaattc 
tatacatata 
agtgtaacct 
t 



atgcttagaa 
tattcaagta 
cgaaaatttc 
actatatgga 
gatgtagtct 
tactacaaat 
tttgtctgtc 
aaattttttt 
ctggcagaag 



aagaagcaaa 

ttttttatcc 

gtgataatcc 

atgcagccgg 

ggaaccttgt 

aacgtttaga 

tcagtgcaca 

tagtgaggat 

aatattggga 

tatgttattg 

cagttcagtc 

tgaaattcta 

tccta.ta.caa- 

atacacaata 

ttaaatgaaa 

cttggaatta 

aaataactca 

gtcattcact 

taatggcaca 

actatctcaa 

tttctttgca 

ccaagaacag 



agcatacctt 

aaaagatccc 

tgattatctt 

tccaagtgga 

gccagaagaa 

agggaaacag 

atgacaatat 

attatttaaa 

tacagtgaaa 

aacacttact 

ttgtttttat 

aacaattaaa 

.t.tctg-taacc- 

atatgaacag 

tcttttgagt 

taatgtactg 

ctaaagcata 

ttgtattaat 

tataatgaaa 

tttgtgtgtg 

gcctgcgtct 

gcacctcctg 



gacttcagag 
acaacaaaac 
ggtcctgcac 
agaaatacag 
"gcagatgagc 
aacctcaatt 
gatttggaaa 
cttttatttt 
gaaaaattca 
cactagaagt 
cagagtgata 
ccaaaattcc 
-atggtrttaaa- 
tatttcagcc 
ttttagccaa 
tacctcttct 
aattacatta 
cttataccaa 
atttagtttt 
tgtgtgtgta 
ggccatccca 
gaattataat 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
-1-02-0- 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1571 



<210> 69 

<211> 1549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 044775CB1 



<400> 69 

cgcgcttagg 

tctgcagccg 

ccgggaccga 

gaggacatgc 

gatgtggtgg 

tgcagtcatc 

ccagrtgtcaa 

aagaagggcc 

ccagagcaat 

atcctgattc 

ttctcccgag 

tgtagaaata 

tggcttataa 

gcttttgaat 

atgtctgctt 

ataaaacaaa 

gtgaaagaag 

gccacaacta 

aaatacagat 

ctttatcgtg 

gccacctatg 

gcaaaaaaag 

acagaagaaa 

ttatagtaat 

cagagcctcc 

ccatgtcccc 



caggcggtgg 
catctcggcc 
agccgcctgc 
gggagagaga 
tacagtggga 
ttctgtgacg 
ccgagtagtg 
tcgttccttg 
atactttgct 
tacccaagta 
aaaagagtga 
tggactaact 
agactcggtt 
gtgttcgtaa 
catatgctgg 
aactactgga 
catcagattt 
tagcatatcc 
ctttttttca 
gtctgacaac 
aattggtggt 
aagaccaaaa 
aatagtttgg 
cacaccacat 
aaggaaatgc 
cagacacagt 



cgcggctgga 
agctctcctc 
cgtagcgggc 
atgagccaga 
gctattctga 
ctttatattt 
tctcccggac 
tttagaggac 
gcttattcaa 
catatgattt 
gatcgtgtca 
taaacctcgt 
acagcttgat 
agtgtatcag 
tatatcagag 
atataagact 
tgtgggaatg 
acatgaagtt 
gactctatct 
tcatctagtg 
ttacctactc 
gattacagtg 
gaacatgtaa 
tacttggcct 
ctttagaagc 
tgggttttgt 



gtgccgcggg 
gccgtccccg 
ggccagatcc 
gggacacgct 
catgtccact 
ctgaagttca 
ctcttcattg 
taggccccaa 
actgcaagga 
cagctgcaat 
tctcatgctc 
tttactgcaa 
gcaaggaacc 
acagatggac 
actgttatcc 
gcttctacaa 
atgctagctg 
gtaagaacaa 
ttgcttgttc 
agacagattc 
aatggatagc 
gaccatggga 
ctattctaag 
ttcggtaatg 
actcctctct 
tgatttatgg 



gagggctgtg 
gggcgctgtg 
gcgtcccgcc 
ggtgcatctg 
ggaagttgta 
gctgaacacc 
cctaaaggtg 
tttagtgggg 
aaagttgaat 
ggcaggcatt 
cccatccgca 
tcacagcaac 
gcggggaaag 
taaaaggatt 
attttgttat 
tggaaaatga 
ctgccacctc 
gactacgtga 
aagaagaagg 
caaacacagc 
agcacgagga 
tacagaagcc 
tggaagtttt 
tgaaaaaaaa 
caaaattgcc 
cagtcttct 



ccggttgctt 
cgtctccagt 
tcagcggccg 
tttgccggag 
aaaacacgac 
atggctggag 
atcttggaaa 
gtagcccctt 
gatgtatttg 
tactgtacat 
ggtcacttcc 
caaccccatt 
gcgaatgggt 
ttataggggc 
ttatgaaagt 
tgaagagtct 
aaaaacttgt 
agagggaaca 
ttatgggtct 
cattatgatg 
ctgctgtact 
agcatggcag 
gttgtaggaa 
aaaaaaacct 
attttctcta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1549 



<210> 70 

<211> 2237 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



51/60 



BNSDOCID: <WO 0073953A2 I > 



WO 00/78953 



PCT/US00/16668 



<223> Incyte ID No: 116588CB1 



<400> 70 

gtcactggtg 

tcccctttgc 

acgtggaggt 

ctatttttgc 

gaataccacg 

tttctttggt 

tagctgcttt 

ttggctgatg 

ccagaataca 

tcaacttagc 

ttctaagatt 

gattgggggt 

tcagagagca 

aactgatgta 

ttatgttgct 

attttcttcc 

gtccccggtg 

cgcagtagca 

gattgctatg 

ggtggcattt 

tcagaatctt 

gtatttgttt 

cttactctat 

ggaagaaata 

ggcttaacac 

gaaaaaataa 

ttgtgaggcc 

tagtaggcta 

caggaaagac 

tatatstctt 

ggaaatagaa 

actagtaggg 

cacgaacaat 

tcaatttctt 

ataaactttc 

gataattgcc 

aaaaaaaaaa 

cttggcactg 



ccacggggcc 
gatcgattcc 
cgagggactg 
tatgataatc 
aaattcatgc 
ggctttttga 
gtttgcattg 
gaatttggaa 
tatgctgtga 
atggctagaa 
gaagctttgt 
gtaacgtgta 
gagagaatcc 
aaggacttct 
gtgttccctt 
caggcagcaa 
tttgggctcc 
gccactcttg 
tgtcttctgg 
gtagttcctg 
gggttggcca 
ttggaagtgt 
ttggtgaatc 
aaattttccc 
atcgttggtt 
tggactggaa 
tgttttagcc 
atcaacaatg 
agtgaaaaat 
agggatacag 
gggctgcgtt 
cataatgcta 
agcttgcgct 
atttctgtgt 
tgtacagttt 
tatgtacatg 
aaaaaatgag 
gccgtcg 



tcagcgcact 
agccacacct 
agctgggcga 
ctgctgccct 
tgctgtatgc 
tagaccgagt 
gacaggttgt 
gatttgtatt 
gctggtttaa 
ttggaagtac 
taggttctgc 
ttctttcact 
ttcataaaga 
ccttacccct 
ttattggact 
gtgcaattaa 
tggtggataa 
tgtcccacat 
gactctccta 
aacatcagct 
tcatttccat 
tcttcattgc 
gtgcccaggg 
atactgaatg 
tgaaaacttc 
agttatattt 
tgtgtctttt 
aaagggttag 
ggaaaacgtt 
ccttttcttt 
acacaaa'ata 
gggaatatgt 
ctactctgta 
tactgaggac 
ttcttatagt 
gataaattaa 
cggccgcaag 



tctgtcttaa 
gtggatgttg 
gttttgtggc 
tcagactcaa 
ctggtattct 
atttggaata 
ttttgccctg 
tgggattggt 
aggcaaagaa 
agtaaacatg 
tggtcacaca 
aatctgtgcc 
acaaggaaaa 
gtggcttata 
tgggaaagtt 
cagtgttgta 
aacagggaag 
gatgctggcc 
ctcattgctt 
gggaactgca 
cattgctggt 
ctgtgtttct 
tgggaaccta 
agaagttaaa 
catttttaaa. 
atatccaaat 
gtattgtgtg 
aaaattgctg 
ggagcttctg 
atcttatagc 
aacaatggca 
gaagatgttt 
gttatgtgga 
cctaatcact 
ctaataagta 
aacactgcac 
cttattccct 



gctcctgggc 
ctagttactc 
actcctttgc 
gttaaacgag 
tggcccaatg 
cgatggggca 
ggtggaatat 
ggcgagtcct 
ttaaacctgg 
aacctcatgg 
accctcggga 
ttggctcttg 
acaggtgaag 
tttatcatct 
ttCCttacag 
tatgtcatat 
aacatcatct 
tttacgatgt 
gcctgtgcat 
tatggcttca 
atgatactgg 
ttgtcacttt 
aattattctg 
atgaatgtgt 
aatttagagt 
atacctattt 
ttgctaaaga 
tggaacatcc 
ttgagataat 
aggaaaaaaa 
ttgtcatagg 
ttatgaagtc 
ttgccgagca 
tagggatgta 
aaaagtgtcc 
acggagtaaa 
ttagtgaggg 



ctccttattt 
cgcgtccgga 
tcttcagcag 
atatgcaagt 
tagttttgtg 
caatcatttt 
ttaatgcttt 
tagcagttgc 
tgtttggact 
gatggctgta 
tcacacttat 
cctacttgga 
ttattaaatt 
gtgtctgcta 
ag aaatttgg 
cagctcccat 
gggttctttg 
ggaacccttg 
tgtggccaat 
tgcagtccat 
attctcgggg 
tatctgtggt 
caagacaaag 
catgagaatg 
ttagtcatta 
caaagtgtat 
attctacttt 
aggtgaactt 
cttcattagg 
aacttttgag 
ccttcccttt 
tctttctgat 
atgacccttt 
attttatagt 
ttcaaattat 
aaaaaaaaaa 
ttaattttag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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1560 
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1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2237 



<210> 71 

<211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



875369CB1 



<400> 71 

tgggaacgga 

tggagggtag 

ttcccagggg 

cgaggagtcg 

aagacacctc 

ctggggcgga 

ggagaacaag 

cctcgtcctc 

agaggaggaa 

aagcacagca 

gggaggaaga 

agcccatgac 

ctgagacggg 

ccaaagaacg 

agatgcgagc 

ggcggcccag 



gcagccccgg 
gagagcgcgg 
tttggagccg 
cagccggtcc 
gaggaactgc 
ggcctcacct 
atccagctcc 
ctcctcttcc 
gaagaagaag 
ggtggaggct 
ggatggccca 
caaggaggag 
gcgcaccagg 
acacagagag 
tgggttgctt 
cctgggcagg 



gggccccctt 
gaaagcgccc 
gcccgcggcg 
cggggacggg 
tcggcctcca 
tctccctgca 
tcctcctctt 
tccagtgatg 
aagaggaaga 
ctgccgggcc 
gccctgacgg 
tgggatgccc 
cttattaagg 
atcaacaagc 
ccctgagggc 
Ctccagggtg 



gaggcggcga 
cagacgccac 
ccatggctca 
ggtcggaaaa 
catcccaagg 
tcacagagag 
cttccagttc 
gccggaagaa 
agctgaagaa 
cctcgctgga 
atgagcagaa 
ggcagagcat 
gagatggcga 
aagccacccg 
ccccgctggc 
ccagtgggaa 



ggccgcgaag 
tcgcggcgga 
cgtcggctcc 
gagaaagaag 
tcgcaaggcc 
aagcaagcag 
ttctagct.cc 
gcgggggaag 
gaagggcaag 
ccagtggcac 
gtcccgaatc 
catccgcaag 
ggtcctagag 
aggggactgc 
caaggcctgt 
gcctgatggg 



ggcgcggggc 
cggcggccag 
cgcaagcgct 
aagagcagga 
agcacggccc 
aaggcccgga 
tcttcttcct 
tacaaggaca 
gagaaggcgg 
cgatcagctg 
caggccatga 
gtggtggacc 
gaaatcgtaa 
ctggccttcc 
ggacgacgct 
tgctggtggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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720 

780 . 
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900 

960 



52/60 



WO 00/78953 



PCT/US00/16668 



ctttcccccg tggattggtc tctggcccag 
ggttggggtc accggcctgc ttggcacccc 
aaaggccccc tggatagaca aaaaaaaaaa 

<210> 72 

<211> 998 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1325518CB1 



cccagtctct tctcaggggc agggggtgga 1020 
catctgaaag agcagcactt ctcagctatt 1080 
aaaa 1114 



-<4-00>-7-2 

gactaaaaaa 
atacattgcg 
aaatgaagct 
tccccttcat 
ttccttatgg 
ccagttaccc 
cttctcctgg 
atgttcctcc 
cagctgcagc 
cagccacacc 
cagaggcacc 
cagaggcacc 
cagccaagcc 
gaaattctct 
ctacaaaagt 
tctcactaca 
cattgataat 



gccatgtatt 
tattttctaa 
tctcctttgg 
tggtgaggat 
catacggaat 
tgggaatact 
attcccctat 
tctccctcct 
acccgctgcc 
tgtagcagct 
tgttggagct 
tgttggagtg 
tgctgcccca 
agaagagtac 
tttctttctt 
ttgatttgtt 
taaataaata 



ctttcgtttc 
aacaataaat 
gcctgcattg 
gacaatgacg 
ttaccacctc 
tacactgaca 
gtctatcaca 

aggggtttcc 

ccacctattg 
gagcctgctg 
gagcctgctg 
gagccagctg 
gaacctcacc 
catgggttca 
ttccaaagac 
tgtggtagtt 
aaatagataa 



tctctaaaag 
ttatagtgtt 
tatgtgttgc 
atggtcaccc 
ctctttatta 
cagggttacc 
tccgtggttt 
cgtttgtccc 
cagctgagcc 
caggggcccc 
cagaggcacc 
cagaggaacc 
cttctccctc 
tttctatact 
tatttcattc 
tttccttgga 
tttagaca 



aagaaaaata 
aatattcata 
ttttgcaagg 
acttcatcca 
tcgcccagtg 
ttcgtatccc 
tcccttagct 
tccttcaagg 
tgctgcagct 
tgttgcagct 
tgttgcagct 
ttcaccagct 
tcttgaacag 
gatgcagaaa 
tgttgtattc 
cttaatttat 



taatttaaaa 
gggtcaatca 
aagagacggt 
tctctgaata 
aatacagtcc 
tggattctaa 
actcagttga 
tttttttcag 
gcacctctta 
gagcctgctg 
gagcctgctg 
gagcctgcta 
gcaaatcagt 
taagtgaaat 
agagtattca 
attgaaaaaa 



<210> 73 

<211> 2348 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2060987CB1 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

998 



<400> 73 

gggccagccg 

cagtcggggg 

ctctcgccag 

catctcgacg 

cgcccccatc 

cagcctcctc 

tcgagaggag 

accagtgttc 

aggacggcca 

gctgctgggc 

gccggggatg 

gtttctagtc 

ccgcctggac 

gggtgaacat 

gcaggcggaa 

cttgttcttc 

gaatgccgtc 

ggcaggctcc 

gaagttcccc 

agcggaggat 

gaactggatc 

cccggagact 

tgtcaccaag 

cgtgcggaca 

cccactcgct 

tgctggcaat 



gctcgcccgg 
cctcccgggg 
actctcacct 
ctgaaagatg 
ctcaacttcc 
catgaagccc 
ttggatcgat 
ccagtgaagc 
gcccctgtcc 
cgaatgctgg 
gtgcgcctgg 
tgctacaggt 
tggcccatcg 
gacaagatgg 
ggcggtggct 
gtcgggaacc 
accaagcact 
ttcctcctcc 
ttgcgcatga 
ggggtcaccg 
gagatcgcct 
gtgggctcgg 
atcatgctgc 
tggtctgtga 
tcccttaaga 
gacattggcc 



gggccatggc 
aagtcatcca 
ggatcccaga 
agaccggagc 
tgcgcaccaa 
agttctatgg 
cttcttgtgg 
ggcggaaccg 
gacggagcaa 
atgagaaaac 
tgtgtggaca 
tgaaggaagc 
aacgactggc 
tggcagcagc 
ccgagatagg 
agctcattgc 
ggcaggtcca 
tgggctgcaa 
aagacaacga 
ccctcagtgt 
atggcaccag 
ggcctcagct 
cggagaagca 
ctcgcttccg 
tcctggctct 
cctacggtga 



agcagcggct 
tctgaatgtg 
ctccttcttc 
aatcttcatc 
agagttggat 
gctcactcc t 
aaacgtcctc 
gcacagccta 
cacgatgccc 
ccctccctca 
ccataattgg 
ctctggctgg 
gctcacagcc 
caccggcagc 
ggtctttcat 
tacaagccac 
ggaggtgcag 
caacggctcc 
cctccttgtc 
ctacctcacc 
ctcagggggc 
cttccagacc 
cctcatctca 
cggcacgatt 
ggagtcggca 
gcgggacgac 



actgcagccg 
ggaggcaaga 
tccagtcttc 
gacagggacc 
cccaggggtg 
ctggttcgtc 
ttcaatggtt 
gtggggcctc 
cccaaccttg 
ccctcaggac 
atcgctgtgg 
cagctggtgt 
cgggtgcatg 
gagatcctgc 
ctgggggtgc 
acagggcgca 
cccatcacca 
atttactacg 
agcgagctct 
cccaagacca 
gtgcgggtca 
t tcactgtgc 
gtctgtgccg 
tccacccagc 
gatgggcatg 
cagcaagtgt 



agggggtccc 
gattcagtac 
tgagcggacg 
ctacagtctt 
tccacggttc 
gcctgcagct 
acctgccgcc 
agcagctagg 
gcaatgcagg 
aacctgagga 
cctataccca 
tttccagccc 
gtggggcttt 
tatgggctct 
ctgtggaggc 
tcggggtgtg 
gttatgacgc 
tggatgtgca 
atcgggaccc 
gtgacagtgg 
tcgtgcagca 
accgcagccc 
acaacaacca 
ccggctccac 
gcggctgcag 
tcatccagaa 
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ggtggtgccc 
ctccgtgcgc 
ctcccggcgg 
cttggccatg 
cctgacggag 
ttcagctccc 
ccactcagcc 
ggctgaggcc 
gagtgggcca 
cactcccctc 
gccatgatgc 
ggaaccccgg 
caccttccct 
aaaaaaaa 



agtgccagcc 
tccgtggacg 
ctcggctctc 
tgggacctaa 
caagagctga 
tcatggggct 
tccagcaaca 
cggcgccgtg 
gacctccgac 
acacctccca 
cttgggatgc 
gttcagggcc 
cttttctgga 



agctcttcgt 
gctcacccac 
ggccccggcg 
ccaccgccat 
Cggaacagct 
gtctccccag 
cctccttgtc 
gtgggggcag 
ggccacccac 
agatgaagct 
cctggtcctg 
agggcctcct 
agccaaagtc 



gcgtctctca 
gacggccttc 
ctacctgctc 
ggacggcctc 
ggaacactgt 
cccctcaccc 
tggccaccgt 
ctttgtggaa 
accagccccg 
caatgaaact 
ggggactcag 
tggaataaat 
accctcccca 



tctactgggc 
acagtgctgg 
actggccagg 
ggccaggccc 
gagctggccc 
cgcatctccc 
gggagcccaa 
cgctgccagg 
tggccctcca 
tccttttgaa 
gtgcctccct 
ggttattgtt 
ataaagtcct 



agcgggtgtg 
agtgcgaggg 
ccaacggcag 
ctgcaggtgg 
cgccggctcc 
tcaccagcct 
gccccccgca 
aactggtgcg 
gcggtctcgg 
caacgcagct 
gattcctgtg 
actaggtccc 
cactgccaaa 



<210> 74 
<21I> 1133 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<223> Incyte ID No: 2172064CB1 



<400> 74 

ctcgagctgg 

ccacccccgt 

ctgccagtgg 

agccgggctg 

gggtcttcca 

atcacgtgat 

agggacggga 

atgtggctca 

acctgtatgg 

gtataacctg 

gagtcagcgg 

gatggtggag 

gcaccagaag 

ttcaagaagc 

gagatgcctg 

tgtccaagga 

gacaactggg 

ctaataaaag 

agtgtcctcc 



gatgtgtggc 

ggggcgcctc 

gcctggagtc 
caaggctgcc 
ggtcatcttg 
ctggcactgg 
gggcaacaca 
gctgggggct 
gttccagatg 
caatctgtcc 
tttctgtctc 
gacctggaag 
acacaagaaa 
agttccagga 
agttggagat 
ggagcttatt 
actgcgctgc 
ccatgtttgc 
ttacttatcg 



aggttcctgc 
ttgcttcccg 
tatggtgatg 
tgcttcgatg 
gtggctgtac 
gaattatcag 
gatgtcccag 
cggcttgtcc 
cccagctcct 
cgcccctctg 
ttgtttactt 
cacaaatctt 
gcaaccgata 
agaagcttag 
gaactttggc 
caccattttc 
cttgactcca 
accattgctc 
caagtaataa 



ggcggctgct 
Cgctcctggg 
agcagagtga 
ccttccaccc 
ccagcgccct 
gaaaggggaa 
gggctggaag 
tggagggggc 
ttgcatgtcg 
agaagaccat 
ttttggagct 
cctcttctaa 
gcctcccagt 
cccaggaaaa 
caactttcct 
tatacatgcg 
cctctacata 
agggaggcat 
aaatcccctg 



ggcggaggag 
attccgcctt 
attcgtgtgt 
cctctccccg 
ctatatgggt 
ggaggaggag 
cctcaggctg 
agccctgggg 
ccgagaacct 
tttcctaaag 
tgtgcttctg 
atacttccta 
ggtggaaacc 
acaaagacca 
catcaccata 
acatat.gt.ag 
caatgactca 
tgctttgggg 
ggaaatcctc 



agccggcgct 
gtgctgctgg 
cacacccagc 
ctgcgttcct 
ttcactctgt 
accctgatcc 
ctctgggctt 
t tgcagtacc 
tgccttggta 
accatgtttg 
ggtttgggga 
acttcagaga 
aaagagcaat 
gttggaccca 
cttaaaatcc 
cagcataacc 
gctaaccaga 
aattattccc 
aaaaaaaaa 



<210> 75 

<211> 863 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 



2219267CB1 



<400> 75 

tcagagctag 

tcctggaggc 

gtggacgccg 

tcctcgctgg 

tcctggcctc 

cggtgatcag 

ccggccccct 

cctcccttgc 

ggctgcccca 

gcccccgcca 

cccccgagca 

ggcctggggc 

aagcccgtgt 



cttccgccgc 
cggcctggag 
ccccgtgctg 
ggcccagtat 
ccgggctgtg 
gggggccggg 
ggacaccctg 
tgtccttgcc 
gtcactgcag 
ggaccacctg 
gcactagtcc 
atggctgagt 
ctcagtgcag 



agcctggcac 
gcggcagcct 
ctgctgggca 
ctgccaggct 
tccgcactca 
ctgggcctgg 
cacggccggc 
c tgctgtgtg 
gacgccgacc 
cctctgctgc 
tgcctggtgg 
accccagacg 
gtggagccgt 



ctcaggtgcc 
tggtcttcct 
ccatggtcac 
ggactgtgct 
gcagcctctt 
tgctgggggc 
agggcttctt 
tcctgctgct 
gcctgcgccg 
cgccctccaa 
ccctgggagc 
tctggtccag 
ggggacagcg 



gaccttctac 
gctcctgacg 
aggcctggca 
gttcctctct 
cgcggccgag 
cgggttcctg 
cctgcaacaa 
gcctgagagc 
ctccccactc 
c tcctactgg 
caggatggga 
ggcagacaca 
tgaaggtgtc 



ctgccctact 
gcagattgct 
tccctgctgc 
gtcctggggc 
gtcttcccca 
ggccaggcag 
gtcgtcttcg 
cgaagccggg 
ctgcggggcc 
gccggccaca 
ccaaagtcaa 
ttcctctcag 
tccagccagg 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2348 
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ccccaggcac tgggaggccc tgggcctccc cccagccaca cccagtaggt gtggaggata 840 
aaggcttctg tggaaaaaaa aaa 863 

<210> 76 
<211> 1322 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



2308629CB1 



<400> 76 
_ccggggggcc- 
actccggccc 
caggcgccgt 
cccggatgca 
tctggaggat 
caacaggcgc 
cattgagtga 
gtgtggcaac 
tgcagatgta 
atgaaggggc 
tccaagccat 
ggtacaaccc 
ccacaacccc 
actcacacat 
taggtggggt 
ccacagccat 
aagagtggag 
gctgcatcct 
ggaaggtaga 
acctctgttg 
gcacctttga 
ggataagcag 
aa 



-ggegcgcggg- 
ggactacgag 
ggcagggatc 
gagtctacag 
tataagaacg 
agggcctgcc 
tgtaatccac 
attacttcat 
caactcacca 
cggggccttt 
tcacttcatg 
aagctcccac 
actggacgtt 
tacaggacat 
gaccgcctat 
cgcatggtct 
ggctggcaag 
ggtcacattc 

ggggctctcc 

ccaccacctt 
taacctctct 
ctcattcccc 



gaggccgggg 
gcgctgccgg 
ctggagcact 
cctgacccag 
gagggcctat 
cacgcccttt 
cctgggggca 
gatgcagcca 
taccaccggg 
taccgcagct 
acctatgaat 
gtcctctctg 
tgcaaaacac 
atcacaggca 
ttccgagggg 
gtgtatgagt 
tgaagtagca 
tctgtctcct 
cccaggattt 
tccttccagg 
ccatcctggg 
tggttcctaa 



cctgcaggcc 
ctggagccac 
gcgtgatgta 
ctgcccgcta 
ggaggcccat 
attttgcctg 
atagccatat 
tgaaccctgc 
tgacagactg 
acaccaccca 
tcctgcagga 
gagcttgcgc 
tgctcaacac 
tggctagtgc 
tgcaggccag 
tcttcaaata 
ctgaacgaag 
ggaatgctcc 
tggtgttttg 
ccctaagcac 
cctgatgacc 
taaaaagcct 



cccggtacga 
tgtcaccacg 
ccccatcgac 
tcgcaatgtg 
gagggggctg 
ctacgaaaag 
tgccaatggt 
ggaagtggtc 
tgtacgggca 
gctgaccatg 
gcactttaac 
aggagctgta 
ccaggagtcc 
cttcaggacg 
agtaatttac 
cctaatcact 
ccaggggttc 
cacctcaagt 
actaacacca 
gtgcagcaaa 
tgctctagac 
ttaaattaaa 



caagatccgg 
cacatggtgg 
tgcgtcaaga 
ttggaggccc 
aacgtcacag 
ttaaaaaaga 
gcggccgggt 
aagcagagga 
gtgtggcaaa 
aacgttcctt 
ccccagagac 
gctgccgcag 
ttggctttga 
gtatatcaag 
cagatcccct 
aaaaggcaag 
agatgacact 
ggagttagaa 
gttcctgcca 
gcacaccaca 
tgttatagag 
aaaaaaaaaa 



60 

120 
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360 
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<210> 77 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2660038CB1 



<400> 77 

cagggacgct 

ggctcgagga 

cttcttcctg 

gctcccgtcg 

ttgacctcag 

ccctgcttgc 

ccctgacacc 

aggccttccc 

gggtcagttc 

gcggggcctg 

gagaactgca 

tcaccatcgg 

tgtaccagtt 

gctacctgca 

tggctggggt 

aggcacaggc 

aggcgctaac 

gcggcctgcc 

aggacctcct 

ctgccatgat 

caaggctcta 



cattcctggc 
aatcaccggc 
cctccctggc 
cccgccatcc 
agggtcctgc 
cagctcttga 
cttgactgca 
cctgtcttcc 
ctgctacagc 
tgtattcacc 
ggcccctggc 
caaggtggat 
cctgatgaat 
cacagccgaa 
catgggagcc 
agcctcagaa 
cgagattggc 
ccgagttatc 
gagccagtgg 
gagtggcatc 
caaccagccc 



cagagtcctg 
tcttctcccg 
ttccatctcc 
tgagccatcc 
cttaagcttc 
ctcccaaatt 
aatctaaatt 
aaccccgctt 
tatcccacca 
aatcccctgg 
acataccagc 
ggccttgctg 
ggcatccgac 
gcgacccaca 
tacttgggga 
attgctgtag 
cagaaacatg 
gtcggttcct 
gagatctttc 
gcagccgtcc 
acagacgcac 



acttctttct 
gatccttttc 
caaccccgcg 
cacctgcaac 
tcaccagaat 
ccagctgacg 
cttatcttct 
tctgacaccc 
tggacttctt 
aggtggtgaa 
ggcactaccg 
ccctgcagaa 
tgggcaccta 
gtcctgcccg 
gccccatcta 
ggcaccagta 
gtctggtggg 
ccacccagct 
ctccccagag 
tggccatggc 
agggcaaggg 



cggatcgaga 
ttctctcact 
attccctcct 
cttctgtctt 
ctcctagatt 
tttgaccact 
gcaacctgta 
atttttactt 
gatgagtggc 
gaccaggatg 
aaatgtcttc 
aggcctggcc 
tgggctggct 
cagcgcagca 
catggtgaag 
taagcatcag 
gttatggcgc 
gtgcaccttc 
ctggaagttg 
accctttgac 
cctcatgtac 



tctcgttgct 
tgctggctztt 
ctactcccgt 
ttgcccctcc 
tctatctcct 
tgacatttga 
ctgctgaaac 
tcttactctt 
ctggcagcct 
cagttgcaag 
catgccttca 
cccgccctct 
gaggctgggg 
gctggggcca 
acacacctgc 
ggcatgtttc 
ggggctctgg 
tcatccacca 
gcgctggtgg 
gtggcctgca 
cgggggacac 
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tggacgctct 
gtgcctccta 
tgcgctccct 
agcactcctt 
ctttcatcca 
actacaagag 
tccctgggta 
tcccactcac 
ccctccaaaa 
ctgttagatt 
aaaaaaaaa 



gctgcagaca 
cttccgcctc 
ctactacaca 
ggccacttgt 
tccatggggg 
atgagtttcc 
actcacaccc 
cactgccacc 
ccaaagggaa 
caaaggcaca 



gctcggaccg 
ggcccccaca 
gacactaaat 
gcctccacca 
acagecaacc 
cttctttcct 
ctctctcagg 
cttgagtccc 
tcgagagacc 
gagattatat 



agggcatttt 
ccatcctctc 
aacagccgct 
ctatgtcctg 
ccactcccca 
tgggtgttgc 
gccgaacgag 
tcctgctccc 
caattcccag 
tgattataaa 



tggcatgtac 
cctcttcttc 
ttcccagtct 
gtgactactg 
tctgttctca 
tttaaacctt 
tcatcccaaa 
atgcacagtt 
gcgtctggga 
gcaagtttat 



aagggtatag 
tgggaccagc 
ccaccaaatg 
attaggtgac 
gggttgaate 
ccctacccat 
gtgtatttcc 
ttaaactcct 
cccaggtgtc 
tctgaaaaaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1869 



<210> 78 

<211> 1881 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2670745CB1 



<400> 78 

gaagaaccga 

cgggttctag 

atagcagagg 

tgttacggtg 

ctgtcgggca 

tgcgtcagat 

cggtgaagtc 

caaccgtcct 

cctgcaatgt 

cagggacagt 

acatttttgg 

ataccagcac 

acttccactc 

gctttgggcc 

ccagaactga 

gccactcggc 

tgaaccggca 

ttgaaccgaa 

acaagattgt 

ttgaccttat 

tgtgcaaact 

ggtgggagct 

atgggtagga 

ctgcccatct 

gttgttgggc 

gggctaggtt 

cacctccttt 

gaagggaagg 

cctccccctt 

aaggagttgc 

gcagataaat 

ttttataaaa 



gcttggctgt 
gctgcaggca 
cagcaggccg 
gacagtgcac 
tcgggtatac 
agatgtgcac 
tgccatccgt 
catcgacgac 
gctctatgcc 
tcctggggcc 
gggctacgag 
catgacatgg 
agccacaatg 
attccattcc 
ggcttggctg 
ctttggctac 
cttccatgac 
ggggaagggg 
cctctttggg 
agatcattct 
ggccgtgatt 
gaatgccatg 
ggaagtttct 
gtcacccacc 
gagaggtgtt 
cctcccccct 
cagctgctcc 
gaatggggag 
tgcctatccc 
agctgttggc 
cccacccttc 
attaattaaa 



gtttatctcg 
gctcgaggac 
tgccgggggg 
ctggagggcg 
tccttcgggg 
attttcaatg 
gggcaagctc 
acagtcctcc 
tttgacgtca 
cgggatggac 
cagcaggcgg 
actcttatct 
ctgggaagtc 
aacaatgaga 
gactgtcccc 
aatggggagc 
ctctggaagt 
ccatgtcccc 
ggtaccagtc 
gacttacaca 
cagtataacc 
accaccaaca 
gccacctccc 
tgctcctttg 
ctctgtgctg 
tgggccgagg 
tgggcctcag 
aagggagaag 
ctcccctctg 
atgagacctc 
cttgagctgt 
a 



ttggggacta 
ccgcggcccc 
gcatgttgct 
ggccccgcag 
gttactgctc 
cagtgtcctt 
ctgtggtacc 
tttggggcgg 
atacgcacaa 
attcagcctg 
actgtttttc 
gtacaaaggg 
acatgtatgt 
tttactgcaa 
cgactccagt 
tgtacatctt 
ttaatcctgt 
gccggcgcca 
catctcctga 
ttttggactt 
tagaccagtc 
gcaatatcag 
ctcctgagcc 
acccctggac 
tgaattcagt 
gccccttccc 
ctctgcccag 
caagcagtgt 
cttgagcctt 
cttctccccg 
cgctgtactc 



aggcgtcggt 
gccccggctc 
gtaaccagtg 
ggtgaaccat 
tggtgaagac 
gcgttggaca 
ctacatgcgc 
gcggaatgac 
gtggttcaca 
tgtcctaggc 
caatgacatt 
cagccctgca 
ctttgggggc 
ccgcattcga 
gctgcctgag 
tggtggttat 
gtcctttacc 
gtgctgctgt 
ggaaggcctg 
tagccctagt 
ctgtttgcct 
tcgccccatc 
tgctgtcatc 
ttggtatacc 
ggggagcrgt 
cttggtgctc 
ggccagccag 
ctgagcctca 
gagccttgac 
tcttggggag 
tgaagttcag 



tggcgcgcaa 
ggcctggcag 
gcccagggga 
gctgcagtgg 
tatgagacac 
aagctgcccc 
tatggacact 
accgaagggg 
ccccgagtgt 
aagatcatgt 
cacaagctag 
cgctggaggg 
cgtgccgacc 
gtctttgaca 
gggcgccgga 
aatgcaaggc 
tggaaaaaga 
attgttggtg 
ggagatgaat 
ctgaagactc 
catgatatca 
gtctcctccc 
ttcactgccc 
tccatgtgga 
agcggggtgg 
tgtccccatc 
gttctgctgg 
ggagcttccc 
tgggagctga 
gtggggacca 
ccagctcaga 
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<210> 79 

<211> 2004 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No; 2676443CB1 

<400> 79 

aatcgcaggg ggcgtggcct agcggtcccg cccccggggc gcgcgcgcgc attggctgtg 60 

cggggtgcgg gcgcgcgggc ggcgctttga accgggcgcg gggcgcgggg cgcggggcgc 120 

tgcggccggt acacgccggg gtagggccgg ggtcgggttg tggtcgggcc gggattgggc 180 
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tctcctgggc 
cctgcggggc 
agcaggaggt 
tcctggcact 
ctcggttcgc 
tccccttcca 
ggacgcccgc 
agcggcgcaa 
tcaggagact 
gccagctgtc 
acaacatctt 
gccagcccct 
ttttccactg 
-tcccagctgg" 
tacgcaaaaa 
caggccagac 
acgacctccc 
gctggctgag 
ggctaccagg 
ctcaggggaa 
tattatccct 
ggacttgggt 
cacttcgtgt 
gcgagacccc 
agggagtggc 
cccgcgcacc 
actcaccagc 
cccccccgct 
ttctccccga 
ggcttcaccc 
ggagctgcca 



catggcagcc 
cgcctgctgc 
gaagctgcgc 
ctacagcggc 
catctacgag 
cgagaaggtg 
agacttggtc 
ctacgcccat 
gttctcgggt 
ctgctacgac 
cactcacttt 
ggatgtgctg 
cgccgtggag 
■catccgc'ctc" 
ctttggcatc 
acgctaggtt 
tggccgtggc 
ctcctcctgg 
ctctcccggc 
caggggctac 
gcctcctgcc 
ggcagagtgt 
ctgctgagag 
gcagctgggt 
ccagggtggt 
gctgcctgcc 
aggcggtctc 
tggctccctg 
caaggagtcc 
cacctgggaa 
aaaaaaaaaa 



gaggcgcgcg 
acgcacccgc 
atgacgggca 
ctgagcgcct 
actgtgcggg 
ttgctgggct 
aacgtcagga 
gcgctggatg 
gcaaccatgg 
caggccaagc 
gtcgccagct 
aagactcgcc 
acagcgaagc 



tgtcgcgctg 
tggacctgcc 
tggcgctgcg 
cgctgtgcag 
accgcgtggc 
ccgtcagcgg 
tgcagaacga 
gcctgtaccg 
catccagccg 
agctggtcct 
ttattgcagg 
tgatgaactc 
tcgggcctct 



atcccccaca 
aaagtgccat 
cttccaaaga 
cacccatcct 
cctcgtcccc 
tgggcactgc 
cagaggctga 
cccgatgccc 
gggtgcagcc 
caaccagacc 
gggatgaaca 
tctggagcca 
ctgcagtggc 
tgtcttcagg 

ggggaatggc 

gacggggcgg 
gggcagtgtg 
aaaa 



ccgtgctcac 
cctgaccagc 
gtcccaagcc 
ccgcagcagg 
tctcagctgt 
gtggccttgc 
tttctcccct 
aaagcagcat 
tggctgttgc 
ttccatgtcc 
agcaacgcag 
ttgtgggtga 
tttaacagtt 
gattgtgcct 
ccaggcgggc 
atgctgcatc 
ctctgtgggg 



gtacttcggg 

caaggtgcat 

ggtggtgcgt 

acagatgacc 

caagggcagc 

tttagctgga 

cgtgaagctg 

cgtagctcgt 

aggggcctta 

tagcaccggg 

tggatgtgcc 

caagggggag 

q qccttt tac, 

ttttgtgttt 

cgtgggaatg 

cagcacctgc 

cccctgctgt 

agctgcacca 

ccctctcccg 

ctcctgggcc 

cttccagcac 

tcacccaagt 

tcgggcagct 

accacaagcg 

gggtcgaggg 

agttttgcca 

gcgtccctcg 

cgcggttcct 

ctctgcctcc 

gctgcaatca 



gggccggcct 

ctgcagacgc 

accgacggca 

tactccctga 

caggggcctc 

ggcttcgtgg 

ccccagggtc 

gaagagggtc 

gtcactgtgg 

tacctctctg 

acgttcctgt 

tatcagggcg 

_aag.gg.cc.tcg_ 

ctggaacagc 

gctgggctgc 

tcctggggcc 

ccccccacct 

cccccgctct 

ctggcagctc 

aggggagggg 

tttccatcga 

gctagctctg 

gcaactcccc 

agtgcctggg 

ccaccgaggt 

aagcctctcc 

ggcacctggg 

ccttagggcc 

ctggtcgctg 

ataaatgccg 



<210> 80 

<211> 3555 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 3295764CB1 



<400> 80 

gaccgtgggc 

aagtgcctga 

cctcccctcg 

gcggccactg 

gcgctgctct 

ccgtacctgc 

ccagtatgga 

ctccgttata 

ctgctctatg 

acagccactg 

cagaaagtca 

ctagggcaaa 

cttacctgtg 

ctcttctttc 

ggtggcattg 

tcaaaaatcc 

cgtctccttg 

cttctctgct 

tacacacagg 

ggcgtggagg 

aaaatatcct 

gctgcagtgt 

gtcttcagaa 

ctcagcatgg 

cagacgctgc 

cagcttttga 



gagtccagaa 
ccctcactcc 
cgccccggat 
tgctcctgcg 
gcgcctacgg 
tggggccgga 
cttactctta 
aacctgttgt 
cccagggact 
aaattgccta 
caagttactg 
tccttgtctc 
tttcagtggc 
accacattcc 
ttactgacac 
ctctaaatat 
tattgaaagt 
ggtctgtgtg 
gcctgtggga 
ccgtttcaac 
ggtcaacttg 
atatcatgga 
tcatctacat 
aacgctatgc 
tcactctaat 
tctatgccag 



cgtcctggcc 
agttggcgga 
ggatgtgccc 
gaccgctcgg 
cttcttcgcc 
caagaacc tg 
cctggtgcta 
tctactgcag 
gctggccatt 
ttactcttat 
tcgaagtgcc 
agtggcaggc 
ttttgctgtg 
ttctacctgc 
cccagcttct 
ggaggagcct 
actatggaat 
gtgggccctc 
gaaagtgatg 
cttactgggt 
gggagaaatg 
cactgtgggt 
gttactcacc 
cctagtactt 
tgtggtagat 
ttattttgca 



ttacagggag 
ggaggagaag 
ggcccggtgt 
gtccgtcgcg 
agccccaggc 
accgagaggg 
ctgtttcctg 
gggctcagcc 
caatttctag 
atctacagtg 
actttggtgg 
tggtcgctgt 
gcctggtttt 
cagagagtga 
aaccaccttc 
cccgtggagg 
gacttcctga 
tctacctgtg 
ccttctcgct 
gccgttgctg 
acactatctc 
aacatttggg 
acgatagcaa 
ggtgcaaata 
gccagcggcc 
ctcatcgccg 



aaggcgtcac 
gaaggggccg 
ctcggcgggc 
aatgctggtt 
cgtccgagcc 
aggtcttcaa 
tgttccttgc 
ttattgttac 
aattttttta 
tggtggacct 
gctttacagt 
tcagcctgaa 
tacctatgcc 
atggcatcaa 
ctggctggga 
aaccggaacc 
tgtgccactc 
gctattttca 
acgctgctat 
tgtttgcagt 
tcttttctct 
tgtgccatgc 
cttttcaaat 
ccttcactgc 
ttggactaga 
cggtccccct 



tcgcggttac 
ggccgggtcc 
ggcggcggcg 
cttgccgacc 
cttcctgacc 
tgaaatttat 
cacagactac 
atggtttatg 
tggcatcgcc 
gggcatgtac 
gggctctgtc 
tgtcatctct 
acagaagagc 
ggtacaaaat 
ggacattgag 
caagccagac 
ctctcgccct 
agttgtgaac 
ctataatggt 
tggttataca 
cctgattgct 
accctatgtt 
tgctgcaaac 
cctggcaccg 
aattaccact 
ggccagtggt 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
_9.60— 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2004 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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gcagtcagtg 
accacttcat 
tgcctggatg 
acttcacagc 
aggagccaga 
ttttttctga 
acatggtgtt 
gtatctactt 
gttgcaagtc 
tcttatttct 
atatttgttc 
ttattttaaa 
tctgtaaaat 
atttagttag 
taaggtacgc 
aagttcatat 
ccaaaataaa 
aatttgtttt 
ttcataaact 
tcctgcattt 
tgcttaaaat 
taattctgaa 
aaagctgact 
ttaacttttt 
gtcgacatac 
aggtcatttt 
gaagggaaag 
ttcttttttt 
gaaaagtgct 
ttcatgttac 
gagccttttc 
gaatgctgaa 
Lttgttgtac 
ttttacagta 



ttatgaagaa 
aatatactgc 
tatttgattt 
agccacttga 
agtaaagttt 
ttcaaaagtg 
gtatacttca 
agccagatgc 
atgatggatg 
gtgttactca 
tggtttagtg 
aaaattcatc 
ttgtttagaa 
aaaataaggc 
ttgcttgtct 
ataggctaaa 
atgaaaaatc 
tcttatccaa 
gagcctcttt 
ttttacctta 
tgctagtgat 
atcagtctac 
ttgttgcctt 
tagaatgaca 
tcatttggtt 
ctattcacca 
ggaagtgata 
gctttgtttt 
gtttgaggga 
caaagcaata 
ttatataaga 
ctcaatgtgc 

tattt 



atgtagaaag 
tgaagggctt 
tttaaagcgt 
ctaatacctt 
atttcatgga 
aacttgattt 
ttagcaagtg 
ttggcctggt 
gaaatatgtt 
tgtacattgg 
ttctgtattt 
atatgaatgt 
gtacgttttt 
aagttttgaa 
ttaaatcttc 
gatgtaacta 
tttttttaat 
gtaaagatgt 
tcaattattt 
aatctgcctt 
ttcattacca 
caaggggctg 
atcttgaaaa 
cagctgaaca 
taagcaggaa 
tcctttacta 
tcataaaagt 
gtttgtgcta 
ttttattaaa 
tttccctgga 
caagaatatg 
aagttgtact 
attctgtata 



ctggaagatc 
cttcttatag 
agacatatat 
gtgttccggg 
ttatttatga 
taaaaaccag 
agtttggtgt 
gggaccaggg 
ctaacagcat 
tctttctaca 
taaggtgtac 
tcttggttcc 
gcattattca 
gcctgctaaa 
aatatgaagg 
ggccatttgt 
aatttcatcc 
caataggaat 
caatgggaca 
tgtttctaaa 
agttacttga 
ctagtatatg 
gaatctagat 
gtgttaataa 
tcctagaagc 
ttaagggaaa 
agcaaccttc 
atttgggaat 
tcttttttta 
atttaattca 
ttcacatctt 
gtatgcagtt 
gattataaaa 



cacaatcaag 
caagaactct 
ttatgaatgt 
aataacatga 
gagctaattt 
tcaagagcaa 
tttataggtc 
ctccacagag 
ctgcctctat 
tagttattct 
gtatcatctc 
cattgtgacg 
tatgcttccc 
tgaagagact 
actattaatt 
atttgtattc 
ctatttatag 
tgcattagtc 
ggaactagga 
ggtagatcat 
aaaaatgttc 
tcagacatga 
aggtgctttt 
tagtgtgtca 
aaatggatgg 
ggaaaagaac 
attttacatt 
tgtgtactcc 
atggaatgtg 
aagtttgtgg 
ggtatgtggc 
ttgtaaataa 
ttatttttat 



ttctcaagta 
gcacagcaac 
gcatttcttg 
tactattcag 
aaggatgact 
tcaaagcagc 
acatatgtct 
gccacaaaat 
tcaatttaat 
atcactggca 
taattttaag 
attatttatt 
agagaagctc 
taagaaagct 
ccaagattaa 
ccttttattt 
tttttatatt 
caaggccttt 
tagatgtgat 
cttgaatatt 
tatatgcatt 
aaactatttt 
aactggggta 
agattgcaaa 
ggataagaat 
actagctaag 
ctgtctgttt 
gaaacaagta 
gtacaaattg 
catacaacct 
catatttata 
gtgaaaataa 
taaataaata 



<210> 81 
<211> 1293 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3438320CB1 



<400> 81 

atatgcccca 

ccctgatatt 

ataaacgctc 

ctgaggagca 

ccggggcctg 

ggaggtggcc 

cgacctaacg 

agttgcaggc 

catagatccg 

tccgcctcag 

gtcgtattat 

gctcacgtcg 

acaccgctat 

gaacctgaac 

tgactatctc 

tgatctcacg 

agtttggctt 

atctccactc 

tgccttcatt 

aagatcaagt 

tgtttgatag 

aatgatggaa 



cgcgggactc 
tatctcagtg 
tcccctcggc 
aaggagtaac 
gtggctgggc 
caacataaca 
tcattggcac 
caggatatca 
ctgaccggct 
ctgccctgtt 
gaggtggggc 
caggagcaca 
ctcccctata 
atggatttta 
agtatggacg 
gagttgtagg 
tttctgtgcc 
ctctccagga 
acagtttgct 
accttggttt 
ttaagagaga 
a tgagatgtc 



atactacgtt 
gacggtggcc 
tcccggacgc 
caagagatcc 
cagacttgga 
ggcccgaaga 
aggaatacaa 
gccactggtt 
gcctgaggta 
cggactgggc 
ggctgtctgc 
cactggaggt 
actcacatgc 
ccctggaaga 
gtacacttca 
caaggagatg 
ttgaggaaaa 
gctagcctgt 
ctgagaaaat 
agggagatgt 
gtagttctac 
tgcaggagat 



tcccgtgaac 
ggaaaaggac 
gacggaggtc 
agtgaccgac 
gtattttcag 
cctctgggta 
ggggaacctg 
tgatccaaag 
ctgcaccccg 
caacgatttt 
caagacccgg 
gggggttctg 
tgccagctac 
gaatgggatc 
cacacctgca 
tacactcgtg 
gtggtggggc 
gcccttctga 
tagtgaatta 
agaagaggat 
aggggtgagg 

ggg 



acgtgcagtc 
aatggtttcc 
gtagtagtag 
agagcaagag 
cgtcgctatt 
tcttacctgg 
ctgctgaaac 
accagagaca 
cggggccgct 
gggaagccct 
agcatccgca 
gagtccatat 
acgtggaaat 
cgggatgagg 
atacttctgt 
tagactcaag 
cgaggggtgc 
agtgtaaagg 
atctttggga 
agtcagagtt 
gatggaagga 



caaaccccgc 
atgtcagcgg 
tgagtacgtg 
ccatgccgcg 
tcacgccggc 
gacgcgtgta 
ccatcgtgga 
tccgcaagca 
ttgtgcacgt 
ggcggcaggg 
tcattaacac 
gggaaatcct 
atgaagggaa 
aggaagaatt 
acttcaatga 
acgtatttcg 
ctggacccag 
ccctattccc 
atgatacaag 
caggcagaac 
cttttttggc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3555 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1293 
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<210> 82 

<211> 1489 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3986488CB1 



<400> 82 

cggggctggc gggcggcgct cttctacggg acctgctcct tcctcaccgt gcttgtcaac 60 

aaggcgctgc tgaccaccta cggtttcccg tcaccaattt tccttggaat tggacagatg 120 
gcagccacca_t^Mt^tj^t_ja^^ 

tttgataaga aaattcctgt aaagctgttt cctctgcctc tcctctacgt tggaaaccac 240 

ataagtggat tatcaagcac aagtaaatta agcctaccga tgttcaccgt gctcaggaaa 300 

ttcaccattc cacttacctt acttctggaa accatcatac ttgggaagca gtattcactc 3 60 

aacatcatcc tcagtgtctt tgccattatt ctcggggctt tcatagcagc tgggtctgac 420 

cttgctttta acttagaagg ctatattttt gtattcctga atgatatctt cacagcagca 480 

aatggagttt ataccaaaca gaaaatggac ccaaaggagc tagggaaata cggagtactt 540 

ttctacaatg cctgcttcat gattatccca actcttatta ttagtgtctc cactggagac 600 

ctgcaacagg ctactgaatt caaccaatgg aagaatgttg tgtttatcct acagtttctt 660 

ctttcctgtt ttttggggtt tctgctgatg tactccacgg ttctgtgcag ctattacaat 720 

tcagccctga cgacagcagt ggttggagcc atcaagaatg tatccgttgc ctacattggg 780 

atattaatcg gtggagacta cattttctct ttgttaaact ttgtagggtt aaatatttgc 840 

atggcagggg gcttgagata ttccttttta acactgagca gccagttaaa acctaaacct 900 

gtgggtgaag aaaacatctg tttggatttg aagagctaaa gagtctgcag caggattgga 960 

gactgacttg tgactgcggg ctgggggggc attcccagta ggaatgtgaa gccagaggtt 1020 

tcggattcgt gacatccacc ccctgggcaa gtgagagcat ctgcaaaatg caaagagaac 1080 

tacctcatat gcaggatgag ccaatggcag tctcaagaaa tgtactcggg cgacacctta 1140 

cctgtggaaa gcaaatcttt tcaaaataag ccactgggac tcggtaggtg gagccccagc 1200 

tgctcttcta gggacctatg gggccttcgt ggcatctctg tgctgtgtgc tggggaggag 1260 

gttgatgtaa tggtgactct tttctgatca gcaccttggc cgtgattccc aaggtcccag 1320 

ccaaagcaaa gggccagttg tttcagttta aacagacatg tctttagtct aataaaatta 1380 

gttaactgcc agtaaagtta tttgttagct ttgatgaaag ctatgttggt atctttccct 1440 

aatcatcaaa gtaaataaaa aatcatttct atgtaaaaaa aaaaaaaaa 1489 

<210> 83 
<211> 927 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incytie ID No: 4378816CB1 



<400> 83 

ctccctgctt 

gcgctgagtg 

tggcggcggg 

gggcgctcat 

tctacttgtc 

aagaatatgg 

aaattctgac 

aagaaaaata 

gctggatgac 

ggctgtactg 

tgctactgtg 

tgaagaagtt 

aatgaatcaa 

tgtattgttg 

tattgttgtt 

taaaatgttt 



tcctctgccg 
ggagctgggg 
ctgcgccctg 
ctggctctgc 
tttagtagga 
caaagctgat 
cgtcgccctg 
ttaccggcat 
cttcctccca 
ttggctttac 
gcagtcatgg 
tcagtgaact 
atctttttgt 
ctcctgaaca 
caattaaatg 
aattatacaa 



catggtcctg 
gccgaggctg 
ggcctgcgcc 
tacgacgcgc 
aacgttgcaa 
gcaagatggg 
gatgggtctc 
ttcctgcaga 
gagtggctca 
ctgttttttt 
ctagaactca 
ttcaaaacca 
ttggccaaaa 
acctgtttca 
gtgatatagg 
aaaaaaa 



ggccgttggc 
gcggttcgct 
tgggccgcgg 
tggtgcactt 
attccgatgg 
tttattttga 
tggcattgtt 
tcaccctgtg 
ccagaagccc 
ttaacggtgt 
agaaaatgca 
taaacaccat 
tgtaatacat 
aattggtttt 
gaaaagagaa 



gtcggaagcc 
gctgctgtgc 
gcagggggcg 
cgcgctggaa 
cttgattgct 
tccaaccatt 
cctcatttat 
cgtgtgcgag 
caacctcaac 
gtgggttctg 
tcagaaagaa 
tatctaactt 
tccagtctac 
aaggcgacca 
caaatttgaa 



tgaagcatgg 
gccgcgctgc 
gcggaccgcg 
ggcccttttg 
tctttatgga 
gtgtctgtgg 
gccatagtca 
ctgtatggct: 
accagcaact 
atcccaggac 
accagttcag 
catgaaccag 
actttgtttt 
gttttcgttg 
tttgtaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

927 



<210> 84 
<211> 970 
<212> DNA 

<213> Homo sapiens 
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1 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



4797137CB1 



<400> 84 

ggatgcagca 

tcctgttctt 

tccgaggtta 

tcactgacaa 

caatatttca 

tttcctgggt 

taaaggacaa 

ttcccatgac 

cccttctttc 

gaatggggag 

ctattgaggt 

y ty tCCy tty 

tatgacacaa 
ctgtcagaga 
taaagacttt 
tcaggaagtc 
tgaagaaatc 



gagaggagca 
ccagggtgtt 
tgttggagaa 
gcttactata 
ttatcagtct 
tggaaatgta 
tgggacattc 
agagctaaca 
catccttgtc 
gaaggctgct 
ttccgatgac 

CyCtyaytyC 

gaagagtcac 
aagtggagct 
aggaaccact 
tcataagaga 



gctggaagcc 
.tatatcgtct 
aagatcaagt 
gactggacat 
ttccagtacc 
tacaaagggg 
agctgtgctg 
gtcacagaaa 
tttgtgccct 
gggctgaaga 
actgatcagg 

ctaaagacag 
ggcctggacc 
tatttattga 
ctcatgactt 



gtggctgcgc 
tttccttgga 
tgaaatgcac 
atcgccctcc 
caaccacagc 
atgcatctat 
tgaagaatcc 

ggggttttgg 

cagccgtggt 
agaggagcag 
aggaggaaga 

ac tatyaay a 

gaaacatccc 
atagcgatgg 
ataaatgttc 
cccctttcaa 



tctcttccct 
gattcgtgca 
tttcaagtca 
cagcagcagc 
aggcacattt 
aagtataagc 
cccagatgtg 
caccatgctt 
ggttgctctg 
gtctggctat 
ggcgtgtatg 

ydCatatuya 

attccactgg 
acaatcctgg 
ttgttgtatt 
tgaattatgc 



ctgctgggcg 
gatgcccatg 
acttcagatg 
cacacagtat 
cgggatcgga 
aaccctacca 
caccataata 
tcctctgtgg 
ctgctggtga 
aagaagtcat 
gcgaggcttt 
tgaaagtctg 
cagctaaagc 
agatcatcag 
tataaactgt 
tgtaattgaa 



<210> 85 
<211> 594 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5470806CB1 



<400> 85 

gacaggatgg 

gaagctggcc 

gctgtccgag 

ggaagctggg 

accacagatg 

tggaaggctc 

gactctggtc 

actgctgtcg 

ccaaagcagt 

caactcggga 



cttcccttcg 
ctgggggtgc 
gcagccctgc 
agccgtttgc 
agaagttcac 
ttggcatttc 
atcgccacta 
tcagtaaccc 
agcttcccat 
agatgccgtg 



cctgttcctc 
tggagaatcc 
tgttgatgtg 
ctctgggaag 
ggaaggggtg 
cccattccat 
caccatcgca 
ccagaactga 
ttgtactgaa 
aaacgatctt 



ctctgcctcg 
aagtgtcctc 
gccgtgaaag 
accgccgagt 
tacagggtag 
gaatacgcag 
gccctgctca 
gggacccagc 
acagtgttct 
attaaaccac 



ctgtactgat 
tgatggtcaa 
tgttcaaaaa 
ctggagagct 
aactggacac 
aggtggtttt 
gcccgtactc 
ccagtaggac 
tgctctataa 
ctgtgatgcc 



atttgcgtct 
agtcctggat 
gactgcagac 
gcacgggctc 
caaatcgtac 
cacagccaat 
ctacagcacc 
caggatcttg 
accgtgttag 
aaaa 



<210> 86 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc__ feature 

<223> Incyte ID No: 5473242CB1 



<400> 86 

gtgttgactc 

tactgttaat 

cagtctgctg 

ctctgtctct 

cgccttcgat 

tgaactccac 

gattgtgatt 

cttccagaag 

ttttcctgct 

ttgtcaaaat 

tgttctgttg 



gcaacctcag 
ggcctgtggg 
attgtctacc 
gctatcatgg 
gatggcctga 
tgtgacaagc 
atgatgggcc 
gtggtggctg 
cttgtctttg 
gacaaagacc 
agataccc 



gaacagacac 
gaaaggtgaa 
cttggaccca 
gtaaccccca 
aacacttgga 
tgcatgtgga 
accacctggg 
gagtggccag 
tgcaatggcc 
tttgaaaatc 



catggtgcac 
ccctgttgaa 
gaggtacttt 
ggtgaaggcc 
caacctcaag 
tcctgagaac 
caaggaaccc 
tgccctggct 
aattgttccc 
tgtcctacta 



ctaactgatg 
attggcgctg 
tctaaatttg 
catggcgaaa 
ggcacctttg 
ttcaggctcc 
accccgagtg 
cacaagtacc 
aagagagctt 
attaaagcat 



ctgagaaggc 
agtcccttgc 
gggacctgtc 
aggtgataaa 
ccagcctcag 
tgggcaatat 
cacaggctgc 
actaaacctc 
ctgtcagttg 
ttggttcaag 
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~7*m 

780 
840 
900 
960 
970 
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420 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

618 
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